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Introductory Quotation 


Ihave myself always thought of a mathematician as in the first instance an observer, a man 
who gazes at a distant range of mountains and notes down his observations. His object is 
simply to distinguish clearly and notify to others as many different peaks as he can .... 
But when he sees a peak he believes that it is there simply because he sees it. If he wishes 
someone else to see it, he points to it, either directly or through the chain of summits which 
led him to recognize it himself. When his pupil also sees it, the research, the argument, the 
proof is finished. 

The analogy is a rough one, but I am sure that it is not altogether misleading. If we were 
to push it to its extreme we should be led to a rather paradoxical conclusion; that there is, 
strictly, no such thing as mathematical proof; that we can, in the last analysis, do nothing 
but point; that proofs are what Littlewood and I call gas, rhetorical flourishes designed to 
affect psychology, pictures on the board in the lecture, devices to stimulate the imagination 
of pupils. This is plainly not the whole truth, but there is a good deal in it. 

—From Hardy [250, Vol. 7, p. 598]. 


To all women mathematicians, 
to my parents, 
to my POSSLQ. 
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Since its beginnings with Fourier (and as far back as the Babylonian astronomers), 
harmonic analysis has been developed with the goal of unraveling the mysteries of 
the physical world of quasars, brain tumors, and so forth, as well as the mysteries 
of the nonphysical, but no less concrete, world of prime numbers, diophantine 
equations, and zeta functions. Quoting Courant and Hilbert, in the preface to 
the first German edition of Methods of Mathematical Physics: “Recent trends 
and fashions have, however, weakened the connection between mathematics and 
physics.” Such trends are still in evidence, harmful though they may be. My main 
motivation in writing these notes has been a desire to counteract this tendency 
towards specialization and describe applications of harmonic analysis in such 
diverse areas as number theory (which happens to be my specialty), statistics, 
medicine, geophysics, and quantum physics. I remember being quite surprised to 
learn that the subject is useful. My graduate education was that of the 1960s. 
The standard mathematics graduate course proceeded from Definition 1.1.1 to 
Corollary 14.5.59, with no room in between for applications, motivation, history, 
or references to related work. My aim has been to write a set of notes for a very 
different sort of course. 

A second impulse pushing me toward the typewriter was the knowledge that in 
the past 30 years there have been some really exciting discoveries in the field of 
harmonic analysis on symmetric spaces and their fundamental domains for discrete 
isometry groups—the work of Harish-Chandra, Helgason, Langlands, Maass, Sel- 
berg, and others. It is time that these ideas received an exposition comprehensible 
to the average applied mathematician, number theorist, etc. In particular, I believe 
that many of the results to be described have interesting implications for statistical 
physics and number theory. 

The outline of the book can be sketched as follows. Chapter 1 concerns Euclidean 
Fourier analysis and its applications to the solution of the wave and heat equations, 
the study of potential functions of crystals, as well as zeta functions of algebraic 
number fields, for example. Chapter 2 deals with spherical Fourier analysis and 
its connections with the Euclidean theory. There are applications to CAT scanners, 
the solar corona, and the Zeeman effect for the hydrogen atom in a magnetic field. 


ix 
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Chapter 3 studies non-Euclidean Fourier analysis on the Poincaré or Lobatchevsky 
upper half-plane H, with an elementary discussion of the work of Harish-Chandra 
and Helgason in this special case. The main idea is to use the group invariance under 
SL(2, R)—the special linear group of all 2 x 2 real matrices of determinant one—to 
determine the spectral measure in the non-Euclidean Fourier inversion formula via 
the asymptotics of the special functions involved. Applications include the solution 
of the Dirichlet problem for cones, wedges, and other domains in Euclidean space, 
as was discovered by Mehler, Fock, and Lebedev long ago. One can also solve the 
non-Euclidean heat and wave equations on H itself. These results will be used to 
obtain a non-Euclidean central limit theorem with applications to the statistics of 
transmission lines. The results of Chap. 3 have different interpretations when one 
realizes that the group SL(2,R) can be identified with many other Lie groups; e.g., 
Sp(1,R) the group of 2 x 2 real symplectic matrices g such that ‘g/g = J if J = 


& ri ) , or the group SU (1,1) of 2 x 2 complex matrices g such that ‘gMg = M 


if M = F 7 ) . Finally, SL(2,R) is locally isomorphic to the Lorentz-type group 


SO(2,1) of real 3 x 3 matrices of determinant one preserving the quadratic form 
X +33 —x3. However, the higher-dimensional analogues of these groups are distinct. 

Non-Euclidean analogues of Fourier series also make their appearance in 
Chap. 3. This is the Fourier inversion formula for functions on H that are periodic 
under the group SL(2,Z) acting via fractional linear transformation with integer 
entries and determinant one. The methods of the first part of the chapter work in 
this case to find the spectral measure on the continuous part of the spectrum of 
the non-Euclidean Laplacian on H/SL(2,Z) just as they did for H itself. However, 
there is, in addition, a discrete part of the spectrum, which remains as mysterious as 
the quanta in quantum mechanics. Applications in this section include the solution 
of the non-Euclidean heat equation on the fundamental domain H/SL(2,Z), the 
computation of class numbers of imaginary quadratic fields, and Peter Sarnak’s use 
of the Selberg trace formula to say something about the asymptotics of units in 
real quadratic fields. Recently Hurt [308, 309] has described applications of the 
trace formula in quantum-statistical mechanics. Chapter 1 of Volume II, i.e., [667], 
will include another application—Kaori Imai Ota’s extension of the converse result 
in Hecke’s theory of the relation between Dirichlet series such as Riemann’s zeta 
function, and modular forms such as the theta function. Imai Ota extends the theory 
to Siegel modular forms of genus 2. Examples of such modular forms are the theta 
functions appearing in the study of abelian integrals. These theta functions arise in 
recent work on the Korteweg—DeVries equation, as well as in Sonya Kovalevsky’s 
solution of the third known case of the problem of the motion of a rigid body about 
a fixed point. 

Volume II, Chap. 1 will extend harmonic analysis to the symmetric space P,, of 
positive definite real n x n matrices Y as well as to the Minkowski fundamental 
domain for P,,/GL(n,Z), where GL(n,Z) is the modular group of n x n integer 
matrices of determinant +1 (with the action Y goes to ‘AYA, if *A = transpose of A). 
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This is the prototype of the general theory for the symmetric space of a noncompact 
semisimple (or reductive) Lie group. Applications of Volume II, Chap. | include the 
study of the Wishart distribution in statistics, integrals arising in statistical quantum 
mechanics, lattice packings of spheres and Hilbert’s 18th problem, and integral 
tests for convergence of sums over matrix variables. The theory that underlies the 
development of the higher-dimensional Selberg trace formula will also be discussed. 
The most important part concerns various matrix argument analogues of certain 
useful special functions such as spherical, gamma, Bessel, Whittaker, zeta, and L- 
functions. There are many thesis topics and intriguing open questions here, some of 
which are quite basic. Thus Volume II has a somewhat unfinished aspect. 

Volume II, Chap. 2 is much more crude. Perhaps it should be considered merely 
as a guide to the literature. My goal was to present an introduction to some 
of the work of Maass and Siegel, because this work had motivated much of 
the development in Volume II, Chap. 1 (and even some of Volume I, Chap. 3). 
Another stimulus was the hope that by fitting various examples into the general 
picture, one would derive a clearer understanding of the examples. There are also 
quite interesting relations between the various symmetric spaces and fundamental 
domains, some of which have been of great use in number theory. Here the key word 
is “base change.” 

I would like to thank those people who have discussed these notes with me over 
the years. It is becoming hopeless to name them all. I would also like to note my 
indebtedness to certain books and papers that influenced me greatly during the time I 
began to write this manuscript: Courant and Hilbert [111], Dym and McKean [147], 
Elstrodt [156], Gangolli [191,192], Hejhal [261,262], Helgason [276], Kubota 
[375], Lang [388], Lebedev [401], Maass [437,438], Mennicke [463], Minkowski 
[471], Selberg [569], and Siegel [597, 599, 600]. 

I feel now that I am probably quite far from reaching the goals that motivated my 
writing. Certainly the average engineer will not want to think about the intricacies 
of GL(n,Z) presented in Volume II, without some more convincing applications. 
Number theorists are just beginning to see the utility of that chapter. But I have to 
admit that the chapter is still in quite a preliminary state. The final word cannot yet 
be said. For this reason (not to mention the size of each volume), I have decided to 
split these notes into two volumes. 

I still hope that both volumes will be useful to a beginner in the subject. The book 
has been used in parts of mathematics graduate courses at U.C.S.D. and M.LT. since 
1978. These courses had names like Lie groups, harmonic analysis, mathematical 
physics, and number theory. The last revision was made at the Institute for Advanced 
Study, Princeton, during the first months of the year of Orwell. 

These notes do assume some things of the reader. At the beginning, advanced 
calculus (with a little measure theory) suffices, along with the ability to look at 
a partial differential equation or a Bessel function without flinching. By Volume 
II, more is required. The reader has to look at a matrix argument Bessel function 
without flinching. [Throughout these volumes I also assume that the reader is 
familiar with the standard facts of modern applied algebra such as the beginnings 
of the theory of groups, fields, and rings, the basic facts about the integers Z and 
Z[nZ.] 
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One final comment: many important details have been consigned to the exer- 
cises. This happened principally because the author likes participatory democracy. 
However, sometimes the author got carried away with the exercises. But I did try to 
include references to the proofs. Perhaps later [should I live another 100 years] an 
answer book will appear. 

I feel impelled to add a P.S. of warning to the reader. This author is incredibly bad 
at proofreading. So, if something appears weird in a formula, feel free to change a 
letter, insert a missing Fourier transform, correct my spelling or history, etc. I would 
be very grateful if you would send me a list of the errors that you have found. And 
I would like to thank those that have done so in the past. 


La Jolla, CA Audrey Terras 
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Almost 30 years have passed since Volume I appeared. Much has changed. I am 
retired. I am happy to have had 25 PhD students. My POSSLQ is now called a 
partner. I am not using a typewriter. You may read the book as an e-book on a tiny 
tablet. I have. More women receive PhDs in mathematics in the United States and 
go on to become full professors at top universities. Obama is president. Yeah! 

Some things, however, never change. Basic mathematics is as it ever was. The 
average person in the United States thinks of a mathematician as someone who can 
balance his or her checkbook. Publishers, bookstores, libraries, and universities are 
in trouble. My country is still involved in too many horrible wars. 

I tried not to change much in the first edition. I made the corrections I knew 
about and added some updates on new developments in harmonic analysis on 
symmetric spaces, keeping myself to those developments that fit in with the spirit 
of the original. No adeles here! I also added a few figures drawn by Mathematica or 
Matlab. Again, though I will correct what I know of errors in the first edition, new 
errors may have been introduced in TeXing the old book. And, yes, I am still bad at 
proofreading. So reader beware! 

Many advances have been made since 1985. We now have Mathematica, Matlab, 
Sage, Scientific Workplace, Google, Wikipedia. We can post math. papers on the 
web and people all over the world can read them. No more mailing of poorly typed 
preprints. But sadly, also, no more handwritten letters. Computers with the power 
of the supercomputers of yesteryear are now on our desks or even in our purses 
or jacket pockets. Star Trek communicators look clumsy compared with our cell 
phones. 
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In the new parts of this second edition, I will discuss wavelets, quasicrystals, 
modular knots, and also provide a glimpse at the way in which modular forms 
appeared in the proofs of some of the oldest conjectures in number theory. I will also 
consider finite analogues of symmetric spaces and their applications in computer 
science—Ramanujan graphs. 

Finally, I thank the makers of Scientific Workplace for making it easier for me to 
deal with this book. 


Encinitas, CA Audrey Terras 
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Chapter 1 
Flat Space: Fourier Analysis on R” 


There was no time to waste. It’s possible to grasp alef-null-sized collections once you’re 
in your aethereal body... but you need some to look at. My job right now was to generate 
infinities. 


I slipped out of my physical body and began running around the room, I did alef-null 
laps... and then the whole rest of the class joined in.... 
“What happened?” a kid with glasses and dandruff asked. “Did you hypnotize us?” 


“The main thing is that you saw infinity,” I said.... 
—From White Light, by Rudy Rucker, Ace Books; NY, 1980, pp. 248-249. Reprinted 
by permission. 


1.1 Distributions or Generalized Functions 


Formalism denies the status of mathematics to most of what has been commonly understood 
to be mathematics, and can say nothing about its growth. None of the “creative” periods 
and hardly any of the “critical” periods of mathematical theories would be admitted into the 
formalist heaven, where mathematical theories dwell like the seraphim, purged of all the 
impurities of earthly uncertainty. Formalists, though, usually leave open a small back door 
for fallen angels .... On those terms Newton had to wait four centuries .... Dirac is more 
fortunate: Schwartz saved his soul during his lifetime. 

—From Lakatos [384, p. 2]. 


In 1927 P. A. M. Dirac [135] introduced “a function” 6(x), which he postulated to 
have the property 


boo 
f (x)6(a—x)dx = f(a) (1.1) 
if f(x) is continuous near x = a. It is easy to see that no such function exists. 
However, delta has been legalized by Laurent Schwartz and others. And we shall 
sometimes find it very convenient to have access to this theory of generalized 
functions or distributions. Thus we begin by summarizing these results. The reader 
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who wants more details should consult any of the following references: Yvonne 
Choquet-Bruhat et al. [87], DeJager [124,125] Dieudonné [134], Friedlander [186], 
Gelfand and Shilov [206], Horvath [305], Korevaar [371], Maurin [458], Schwartz 
[565-567], Strichartz [643], Treves [684], and Vladimirov [706]. The concept of 
distribution is a very natural one in applied mathematics. It can be used to represent 
an impulse, point mass, or point charge, for example. It also provides a natural 
method to use in obtaining fundamental solutions of partial differential equations, 
as we shall see in Sect. 1.2. 

There are two definitions of distribution. One of these goes as follows. Let D 
be the space of test functions f : R” — C such that f and all partials of all 
orders of f are continuous and vanish off a bounded set. Then a distribution T 
is a continuous linear functional T : D — C. Here we need a topology or notion 
of convergence of sequences of test functions in order to say what we mean by 
the continuity of T. We say that a sequence of test functions converges if they 
all vanish off the same bounded set and all partials of the sequence converge 
uniformly to the corresponding partial of the limit function. Note: We do not 
ask that the convergence be simultaneously uniform for all partials. This rather 
intricate definition of convergence of test functions is designed to produce a very 
simple calculus of distributions. For example, all distributions will be infinitely 
differentiable. 


Exercise 1.1.1 (Distributions Generalize Functions). Suppose that f(x) is a 
locally integrable function; i.e., f is Lebesgue integrable over every bounded 
Lebesgue measurable subset of R”. Then f defines a distribution Tẹ via 


Ty(g) = | Feetaiax for any gin D. 


Here dx = the usual Lebesgue measure on R”. The exercise is to prove that Ty is 
indeed a distribution. You should also show that two locally integrable functions fi 
and f define the same distribution if and only if fı = f2 almost everywhere. 


Exercise 1.1.2 (The Dirac Distribution). Define (g) = g(0) for each test func- 
tion g in D. Show that delta is indeed a distribution (i.e., continuous and linear). 


The second definition of distribution presents T as an equivalence class of 
Cauchy sequences of locally integrable functions. Here again convergence is defined 
with respect to test functions. That is, f, converges to f means that f fag dx 
converges to f fg dx for every test function g. You can find more details in Korevaar 
[371], for example. 

The Dirac delta distribution corresponds to the following type of sequence, for 
example. 


Definition 1.1.1. A Dirac sequence of positive type K, : R” — C has the 
following properties: 


1.1 Distributions or Generalized Functions 3 


(1) Kn is integrable over R”. 

(2) Kn is nonnegative. 

(3) J Kn(x) dx=1. 

(4) For every €,6 > 0, there exists an N € Z* such that when n > N : 


K,,(x)dx < €. 


x28 


It is easy to show (see Lang [386, pp. 211-213]) that if f is bounded and 
continuous on R”, then the convolution (Kn*f) (x) = f Kn(x—y)f(y)dy approaches 
f(x) uniformly, as n goes to infinity, for x in any compact set. 


Examples of Dirac Sequences for R” = 


Example 1.1.1 (The Fejér Kernel). 


1 f sin(nzx) 2 fi <1 
Ky (x) = n sin(7x) , tOr |x| = 7) 
0, otherwise. 


This kernel can be used to show that the Fourier series of a continuous periodic 
function can be Cesaro summed to converge uniformly to the function (see Sect. 1.3 
and Lang [386, p. 233]). 


Example 1.1.2 (The Landau Kernel). 


cn(1—x?)", for |x| <1, 
L = = 
n(x) { 0, otherwise. 


This kernel can be used to show that any continuous function on a finite interval can 
be uniformly approximated by polynomials since L,*f is a polynomial (see Lang 
[386, p. 214]). 


Example 1.1.3 (The Gauss Kernel). Fort > 0, 


G;(x) = exp[—x’/(2t7)], as t approaches zero from the right. 


1 
t/2n 
This kernel is the fundamental solution of the heat equation in one space variable 


(see Sect. 1.2). It is also the density for the normal distribution of mean 0 and 
standard deviation vt. See Fig. 1.2 for some graphs of G;, for t between 0.1 and 1. 
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Exercise 1.1.3. (a) Check that the three preceding examples are Dirac sequences 
of positive type. Show that the convolution K, * f of any of these kernels with 
a continuous function f approaches f uniformly on compact sets as n goes to 
infinity. For Example 1.1.3, the index n is replaced by the positive real number 
t which approaches 0 from the right. 

Then show that any continuous function of period 1 can be uniformly 
approximated by polynomials P(x) as well as by trigonometric polynomials 
P(exp(2ziax)). 


(b 


wm 


To obtain Dirac sequences for R”, m > 1, take K,(x) = Ka (x1) -++ Kn (Xm) for 
(Fiesta): 

An example of a sequence approaching delta distributionally which is not a 
Dirac sequence of positive type is the Dirichlet kernel, used to show that the 
Fourier series of a continuously differentiable function converges to the function 
(see Exercise 1.3.1 of Sect. 1.3 and Lang [386, p. 237]). 

It is clear that Ty in Exercise 1.1.1 corresponds to a sequence of functions. Take 
every element of the sequence to be f itself. 

It is easy to define the restriction, sum, scalar multiple, translate, support, and 
derivative of a distribution. For example, the support of a function f is the smallest 
closed set outside of which f vanishes. We say that a distribution T is zero on an 
open set U in R” if T(g) = 0 for all test functions g with support in U. Then the 
support of a distribution is the smallest closed set outside of which the distribution 
is zero. 


Exercise 1.1.4. Show that the support of the Dirac distribution is {O}. 


Integration by parts says that if f’ is locally integrable and g is a test function of 
one variable, then 


Tpl) = | fe=— | fe! = -Til 


This suggests that the partial derivative OT /ðx; of a distribution T should be 
defined by setting 


(OT /dxi) (8) = —T(0g/0xi) 
for any test function g. One must check that —T(dg/0x;) has a continuous linear 
dependence on the test function g. This follows from the way that convergence of 


test functions was defined. Thus all distributions are infinitely differentiable and all 
mixed partials of distributions must be equal. 


Exercise 1.1.5 (The Distributional Derivative of the Heaviside Step Function Is 
the Dirac Delta Distribution on R” = R). Define the Heaviside function by 


AG) = 0, x <0, 
1, x >0. 


Show that H’ = 6, as distributions. 
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The following exercise is useful in the study of PDEs involving the Laplace 
operator A. Suppose that E is a fundamental solution of Laplace’s equation (also 
known as the free space Green’s function for the Laplacian ); ie. AE = 6 (a 
distributional differential equation). Then, if we are given a sufficiently nice function 
f and we seek to solve the PDE Au = f for the unknown function u, it will easily be 
seen that u = f*E (cf. Theorem 1.1.1, which follows). It is harder to find the Green’s 
functions satisfying various sorts of boundary conditions (see Garabedian [197]). 


Exercise 1.1.6 (The Fundamental Solution of the Laplacian in R”, m > 3). 

Show that in the sense of distributions: if A = 0? /dx7 +---+ 07/032, then 
A(||x||7") = (2— m)Smô, where Sm = area of unit sphere in R”. 

Thus the fundamental solution for the Laplacian is Ej, (x) = ¢m||x||7~", 


Cm = [Sm(2- m) E. 


Hint. Let T be the distribution of the function Em in the sense of Exercise 1.1.1. 
You must show that (AT) (g) = T (Ag) = g (0) for each test function g. Now T (Ag) = 
Cm J ||x||7 "Ag dx. To evaluate this integral, apply Green’s theorem to the region 
obtained by removing a ball of radius r from R". Then let r approach zero. In 
Sect. 1.2, we will learn another way to do this. 


Exercise 1.1.7 (The Area of the Unit Sphere in R”). Show that 
1 m 
area{x € R” | lal =1}=27(5) /T(m/2). 


Hint. Consider the integral of exp (—||x||”) in polar coordinates and recall Euler’s 
formula for the gamma function (see Lebedev [401 ]). 


It is useful to note a few general facts about distributions that we do not have 
the space to prove. There are three such results. 


Three Facts 


(1) Any distribution supported by {0} is a finite linear combination of partial 
derivatives of delta. 

(2) Any distribution with compact support extends to a continuous linear functional 
on the space C”(R”) of all infinitely differentiable functions on R”. Here 
convergence of a sequence of C” functions means uniform convergence of 
partials on compacta. 
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(3) Every distribution is locally the result of applying a distributional differential 
operator (with constant coefficients) to a continuous function. Here “locally” 
means: when restricted to test functions with support in a given compact set. 


Remarks On Proofs. 


(1) See Lang [387, p. 448]. 

(2) See Friedlander [186, p. 35]. 

(3) See Gelfand and Shilov [206, Chap. 2]. The proof can be sketched as follows. 
Now test functions with support in a compact set S are the intersection over p 
of functions which are continuously differentiable of order up to p on S. This 
dualizes to say that a p must exist so that T will be a continuous linear functional 
on functions continuously differentiable of order up to p. Then one uses the 
Hahn-Banach theorem, the Riesz representation theorem, and integration by 
parts. We shall use this result in the discussion of convolution and Fourier 
transform of distributions. 


One defines convergence of a sequence T, of distributions to a distribution T as n 
approaches infinity to mean T,(g) approaches T (g) for every test function g. When 
thinking of functions as distributions, as in Exercise 1.1.1, this type of convergence 
is often called weak. The Lebesgue dominated convergence theorem gives an 
easy condition for weak convergence of sequences of functions. It is of course 
much easier for a sequence of functions to converge in the sense of distributional 
convergence than to converge pointwise. Note that if K, is a Dirac sequence, then 
Tx, approaches 6 as n — œ. 

It is easy to see that the operations of differentiation and passing to the limit 
can always be interchanged distributionally. This is false, of course, for sequences 
of functions. Just as easily, one can see that very weak conditions guarantee the 
convergence of series of distributions (for example, see Schwartz [565, p. 97]). 
In Theorem 1.3.3 of Sect. 1.3, we shall see some examples of Fourier series of 
distributions. 

In general, it is impossible to define the product of two distributions without 
abandoning associativity, for we shall see that on R we have x6 = 0. But then 


(<:)3= and * (x6) =0. 


X 


Note also that if f is locally integrable, there is no reason for f? to be locally 
integrable; e.g., f(x) = |x|~!/. The nastier T is, the nicer S must be for TS to make 
sense. We shall confine ourselves to the following situation. 


Definition 1.1.2. If T is a distribution and @ is any infinitely differentiable function 
on R”, define the distribution aT by (a7T,g) = (T,ag), using the notation 
T(g) = (T,g), for any test function g. 
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Exercise 1.1.8 (Products of Infinitely Differentiable Functions œ and Distribu- 
tions T). 


(a) Show that 


(b) Show that aô = a(0)6. 


One must similarly make some restrictions in order to define the convolution 
of two distributions. The idea of convolution is quite important. For example, 
solutions to partial (and ordinary) differential equations are often convolutions (cf. 
Exercise 1.1.6 and the next section). In probability theory, the density function of 
the sum of two independent random variables is the convolution of the two density 
functions (see, Exercise 1.2.22 in Sect. 1.2 or Apostol [10, Vol. II, p. 552)). 

In order to figure out what the convolution of two distributions should be, we 
must recall the definition of convolution fxg of two functions f, g : R” —> C, 
assuming both of the functions are locally integrable and that at least one has 
bounded support: 


(Fes) (a) = [Fle -y)s0)dy= f FO- y)dy (1.2) 


Rm 


The convolution of f,g in L! (R”) = the Lebesgue integrable functions also makes 
sense. 

One can define the convolution of two distributions as the equivalence class 
of Cauchy sequences obtained by convolving the functions representing the two 
distributions, assuming that one of the distributions has bounded support. The 
definition as a continuous linear functional goes as follows. 


Definition 1.1.3. Suppose that S and T are distributions and that S has bounded 
support. Then define the distribution T « S as follows using the notation T(g) = 


(7,8): 
(T *S,g) = (T(y),(S(x),g(x+y)) for each test function g. 


Here we are abusing notation by writing distributions as if they were functions in 
order to keep track of variables. 


In order to see that the preceding definition of convolution of distributions makes 
sense, one must show that (S(x),g(x-+y)) is a test function as a function of y. 
To see this, use the fact that S is the distributional derivative of a continuous function 
(locally for test functions vanishing off a given compact set). 


Exercise 1.1.9. (a) Check that convolution of functions is associative and commu- 
tative, assuming that all the functions are in L!(IR”). Show that convolution of a 
Lebesgue integrable function with a differentiable function produces a differentiable 
function. 
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(b) Suppose that f and g are locally integrable functions and that g has compact 
support. Show that Ty. = (Ty)*(Zz), using the notation of Exercise 1.1.1. 


Theorem 1.1.1 (Properties of Convolution). 


(a) Regularization. The convolution of a distribution of bounded support and a test 
function is a test function. 

(b) Txô=T, Tx(32) = or with ae denoting the partial derivative of T with 
respect to Xj. 

(c) T*S = SxT if the support of S or of T is bounded. 

(d) (T*S)*R = Tx(S*R) if the supports of any two of these distributions are 
bounded. 

(e) 2 (T*5) = (xs = T+(35) if the support of S or T is bounded. 

(f) Suppose that T, approaches T in the sense of distributions. Then T,*S 
approaches T*S as n goes to infinity, provided that one of the following 
conditions holds: 


(i) the T, have uniformly bounded supports, 
(ii) S has bounded support. 


(g) Define the translate! of a distribution T by a vector a € R” as 
Ta(g) =T(g—-a) where ga(x) =g(x+a). 


Then T *6g = Ta and 64* 5p = Og+b- 
Exercise 1.1.10. Prove Theorem 1.1.1. 


It follows from Theorem 1.1.1 that every distribution is the limit of a sequence 
of test functions. To see this, note that there is a sequence of test functions 
Kn approaching the Dirac delta distribution. Therefore 7*K, approaches T as n 
approaches infinity. For we can find K, with uniformly bounded supports and apply 
part (f) of Theorem 1.1.1. 


Exercise 1.1.11. Show that the support of (S*7) is contained in the closure of the 
set of points x + y, with x in the support of S, and y in the support of T. 


1.2 Fourier Integrals 


Euclidean methodology has developed a certain obligatory style of presentation. I shall 
refer to this as “deductivist style.” This style starts with a painstakingly stated list of 
axioms, lemmas and/or definitions. The axioms and definitions frequently look artificial 
and mystifyingly complicated. One is never told how these complications arose. The list of 
axioms and definitions is followed by the carefully worded theorems. These are loaded with 


'This notation may be confusing since we are also using subscripts for many other purposes. 
Reader be alert! 
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heavy-going conditions; it seems impossible that anyone should ever have guessed them. 
The theorem is followed by the proof... . 

Deductivist style hides the struggle, hides the adventure. The whole story vanishes, the 
successive tentative formulations of the theorem in the course of the proof-procedure are 
doomed to oblivion while the end result is exalted into sacred infallibility. 

—From Lakatos [384, p. 142]. 


1.2.1 Fourier Transforms 


We want to understand the Fourier transform of a distribution on R”, but it makes 
sense to start off with Fourier transforms of Schwartz functions. Our study of 
Fourier transforms on Euclidean space will be sketchy. For more details, the reader 
could consult any of the following references: Benedetto [34], Bochner [43], 
Yvonne Choquet-Bruhat et al. [87], Dieudonné [134], Dym and McKean [147], 
Körner [372], Lang [386,387], Marks [453], Maurin [458], Schwartz [565,566], 
Stade [617], Stein and Shakarchi [635], Stein and Weiss [636], Strichartz [643], 
Titchmarsh [678], Zygmund [757]. 


Definition 1.2.1. The Schwartz space S is the space of all infinitely differentiable 
functions f : R” — C such that pepe | is bounded for all a,b € Z”, with a; > 0 
and b; > 0 for all j. Here we use the notation: a = (a ,...,@m), 


eS x! -xm and DÈ = _—-— av ; 
ý ax}! axhin (1.3) 


with |b] = bi +--+ bm. 
We will call such functions f Schwartz functions. 


Definition 1.2.2 (Fourier Transform of Schwartz Functions). Suppose f € S. 
Then define the Fourier transform f by 


fo) = [Fo exp (—2mi ‘xy) dx. 
R” 


Here we write x € R” as a column vector and ‘x as the transpose of that vector. Thus 
‘xy is the inner product of x and y. 


Note that the integral defining f (y) is easily seen to be convergent. 
Theorem 1.2.1 (Properties of the Fourier Transform on the Schwartz Space). 
(1) f € S implies that f € S. 
(2) D° (f) = ((—2zix)* f) , using the notation (1.3) for differential operators. 
(3) (Df) ^ = (2nix)“f. 
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(4) Convolution Theorem: (fre) = f-&, where convolution is defined by 
formula (1.2). 
(5) Translation: Set fa(x) = f(x+a) for a,x in R”. Then 


fa(x) = exp(2ni tax) f(x). 


(6) A Function That Is Its Own Fourier Transform: Let f(x) = exp(—n||x||*). 
Then f = f. 

(7) Dilation: Let u be a positive real number and set “ f(x) = f(ux) for x in R”. 
Then 


(8) Multiplication Formula: f fg = f fê. 

(9) Fourier Inversion: Define f~ (x) = f(—x). Then f= f~. 

All of the functions in the preceding statements are assumed to be Schwartz 
Junctions. 

Proof. Everything is an exercise except part (9). To prove (9), note that the operation 


of replacing f by f ~ commutes with translation. Thus it suffices to prove (9) at 
x = 0. Then property (8) shows that it suffices to prove (9) for a Dirac sequence of 
Schwartz functions such as the Gauss kernel from Sect. 1.1. For we have, assuming 


fina] fea [te =| re 


The left-hand side of this equality approaches f (0) and the right-hand side 
approaches f(0) as n goes to infinity, since K, approaches the Dirac delta 
distribution. 


Exercise 1.2.1. Prove property (2) of the Fourier transform by differentiating under 
the integral sign. Then prove property (3) by integration by parts. Property (1) then 
follows easily from (2) and (3). 


Exercise 1.2.2. Prove property (4) by changing the order of integration. 
Exercise 1.2.3. Prove properties (5) and (7) by making the right substitutions. 


Exercise 1.2.4 (Fourier Transforms of Gauss Kernels). Show that 


f(x) =exp(—al|x||), xE R", 


is its own Fourier transform by computing af (x) using integration by parts and then 
solving the resulting differential equation. Deduce property (6) of Theorem 1.2.1. 
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Then let G, be the Gauss kernel (Example 1.1.3 of Sect. 1.1). Show that G; = G; = 


G, using the seventh property of the Fourier transform. 


Exercise 1.2.5 (More Properties of the Fourier Transform). Show that if f and 
g are Schwartz functions: 


(a) f ++ f is one-to-one, linear from S onto S. 

(b) (fg) =f * 8. 3 

(c) Define an inner product for f,g in S by (f,g) = f f Z. Then (f,g) = (f,8) 
(Parseval’s identity). And, setting || f||2 = (f,f)'/*, we have the Plancherel 
identity || f||2 = || f||2- 


Mathematicians call the last statement of Exercise 1.2.5 the Plancherel the- 
orem. And we will see many generalizations of it in the succeeding chapters. 
Plancherel proved it in 1910. However, Rayleigh used this result in his study of 
blackbody radiation in 1889 (see Rayleigh [537]). Thus it might be more accurately 
called Rayleigh’s theorem. In later chapters this result will be equivalent to the 
determination of the spectral or Plancherel measure in the inversion formulas for 
the generalized Fourier transforms that we shall study. 

Now we should turn to Fourier integrals of nastier functions than Schwartz 
functions. The space of all Schwartz functions is dense in the space L?(IR”) of all 
Lebesgue square integrable functions on R”. See Apostol [11], Yvonne Choquet- 
Bruhat et al. [87], Dieudonné [134, Vol.II], Dym and McKean [134], Kolmogorov 
and Fomin [366], Korevaar [371], Lang [387], Maurin [458], or any of countless 
texts that treat the theory of the Lebesgue integral. The book by Dym and McKean 
is strongly recommended for this purpose. 

The continuous linear map of S onto S (using the L?-topology of Exercise 1.2.5) 
given by ff, has a unique continuous extension to a map on L? (R”). The 
extended mapping will be one-to-one, onto and a linear Hilbert space isomorphism 
of L? (R”). It is easy to show that if f € L? (R”), then f(y) is the L?-limit, as n goes 
to infinity, of the finite Fourier transforms: 


a f(x)exp(—2ri'xy)dx. 


Here you need the L* dominated convergence theorem. The inverse transform has a 
similar characterization. 

Wiener showed in [736] that it is also possible to give an elegant description 
of the Z? Fourier transform using Hermite polynomials. We will discuss this in 
Sect. 1.3. 

If f is in L! (IR); i.e., if f is Lebesgue measurable and || ||; = f |f] is finite, then 
the Fourier transform f(y) = f f(x) exp(—2zi 'xy)dx exists as a Lebesgue integral. 
We could not say this for L? functions. Thus it appears more natural to discuss 
Fourier integrals for L! functions. However, there is a problem with L! functions. 
The Fourier transform of an L! function need not be L!. A simple example on the 
real line is given by that in the following exercise. 
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Exercise 1.2.6 (A Function Where the Fourier Transform Is Not in L'(R)). 
Define 


0, |x| > 4, 
TI(x) = ¢ 3, |x| = 4, 
1, |x| < j: 


Show that TI(y) = sin(zy)/(zy). Then show that IT is not in L! (R). In fact, this 
gives us an example of a function that has an improper Riemann integral on R but 
is not absolutely integrable and thus is not Lebesgue integrable. 


Theorem 1.2.2 (Properties of the Fourier Transform of L'Functions). If f € 
L! (R”), we have the following facts. 


(1) || lee =1.u.b. |f(y)| < || fl], (.u.b. =least upper bound). 
(2) f: R” > C is continuous. 

(3) (faa) =f-8. i 

(4) Riemann-Lebesgue lemma: limy» f(y) = 0. 

(5) Î=0 < f =0. Here we mean 0 almost everywhere. 


Discussion. The proofs of these properties are not hard, using Lebesgue dominated 
convergence, Fubini’s theorem, etc. To prove the Riemann—Lebesgue lemma, note 
that given an L! function f, one can find a Schwartz function g that is close to f in L! 
norm. Then f(x) and g(x) must be close for all x. Now use the fact that ¢ approaches 
zero as x goes to infinity. More details can be found in Dym and McKean [147] or 
Stein and Weiss [636], for example. 

In part (5) of Theorem 1.2.2, we are saying that the Fourier transform defines a 
1-1 map of L! (R”) into the continuous functions on R” that vanish at infinity. The 
actual image is a very complicated subalgebra, which has not yet been characterized. 

You might well ask how to reclaim a function f in L!(R) from its Fourier 
transform. Since f need not be in L!, it is necessary to use some kind of summability 
procedure. The time-honored method for piecewise smooth functions is outlined in 
the following exercise. Other methods can be found in Dym and McKean [147, 
pp. 103-106]. 


Exercise 1.2.7 (A Dirac Family Associated to Fourier Inversion). Suppose that 
f € L! (R) and f is piecewise continuous (i.e., it can have at most a finite number of 
jump discontinuities in each finite interval). If the one-sided derivatives of f exist at 
x in R, show that 


1 fot) +f) = lim ( fip)exp(2ni)dy) 
ly|<r 


2 roo 


Se) wi wee. 


r= TUX 
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Here 


f(x+) = lim f(w) and f(x—-)= lim f(w). 


WXxX,W>x w—>x,w<x 


Hint. (cf. Exercise 1.1.1 of Sect. 1.1) Write 


F; Fe 
f(y) exp(2nixy)dy = | | exp(2miy(x—u)) f(u) du dy. 
ly|<r ly|<r Ju=—co 


Then interchange the order of integration. 
To complete the solution, you need the Riemann—Lebesgue lemma (Theorem 1.2.2, 
Part 4) and Exercise 1.2.14 below which says that 


° sinx T 
ao i 
0 x 2 


Note. Itis easy to extend Exercise 1.2.7 to R”, since 


[rcaen PO) exp(2mitay)dy = (FW), 


lyl<rj 


where 


m 
W,” (x) =[[W,,(xj) for xe R”, re (R*)". 
j=l 

Fourier analysis, however, did not originate in this textbook style of theorems 
and exercises. At the beginning, there were Fourier’s experiments on heat diffusion. 
And there was much controversy! For example, Lagrange prevented the publication 
of Fourier’s 1807 paper on the subject. See Grattan-Guinness and Ravetz [230] for 
the first published version of Fourier’s paper, as well as an interesting discussion 
of Fourier’s life and work. There was tremendous rivalry between mathematicians 
such as Fourier, Cauchy, and Poisson. Colossal arguments took place over the truth 
of the proposition that “any” function can be represented by its Fourier series. 
Mathematicians made insulting reviews of each other’s papers. Out of real physical 
questions and equally real mathematical difficulties came modern analysis—the 
Riemann and Lebesgue integrals and, later, distributions. See also Riemann [542, 
pp. 227-271], for the fascinating history of Fourier analysis. Other historical 
references are Burkhardt [72] and Hilb and Riesz [296]. Another interesting 
reference is I. Lakatos [384, pp. 128-131]. 


Example 1.2.1. [Heat Flow on an Infinite Rod] The problem of heat flow on an 
infinite rod can be posed mathematically as follows, making many simplifying 
assumptions about the rod: 

Find u(x,t), x € R, t > 0, satisfying the heat equation: 
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oe =at, for xER, t>0, 
u(x,0) = f(x), where f(x) = given initial heat distribution. 

For a derivation see Dym and McKean [147, p. 61], or Vladimirov [706, p. 31], 

for example. Or you may want to read Fourier’s original paper with its various 

approximations to the “correct” partial differential equation in Grattan-Guinness 

and Ravetz [230]. The constant a in the differential equation depends on the specific 

heat of the (uniform) material composing the rod. 

To solve the problem, formally Fourier transform both sides of the partial 
differential equation with respect to x. Use the fact that the Fourier transform takes 
differentiation to multiplication by 27ix (property 3 of Theorem 1.2.1). You obtain 

a = 47" x a û. 
Therefore, û = f exp(—477a7x°t). 

Using Exercise 1.2.4, we see that the Fourier transform of exp[—(27ax)t] as a 

function of x is 


G(x) = (2aV/nt)~| exp[—x?/(4a7t)], where G, = Gauss kernel, v= a(2r)!/?. 


Now use the convolution property of the Fourier transform (Theorem 1.2.2, Part 3) 
and Fourier inversion to see that 


u(x,t) = ii(—x,t) = Gy * f E (Goaz xf) (x). 
Here convolution is over the variable x. 


The entire procedure of the preceding example must now be justified, since we 
did not know at the beginning that it was legal to use Fourier transform properties of 
u(x,t), as it was an unknown function. Rather weak assumptions on f will suffice to 
make G a(21)1/2 (x) x f a solution of our problem, since the Gauss kernel is infinitely 
differentiable and a Schwartz function. Moreover, G, is a Dirac sequence of positive 
type as t approaches 0 from the right. We are using Exercises 1.1.3 and 1.1.10 from 
Sect. 1.1. 

We shall see at the end of this section that the Gauss kernel is also of central 
importance in probability and statistics. 


Exercise 1.2.8 (D’Alembert’s Solution of the Wave Equation). Consider the 
displacement u(x,t) of an infinite homogeneous vibrating string at position x on 
the real axis and time t. Making enough simplifying assumptions, one finds that 
u(x,t) must satisfy the wave equation (see, e.g., Dym and McKean [147, p. 62]): 


2 2 
ota a Th, for xeR, t>0, 
u(x,0) = f(x), f(x) = given initial heat distribution, 


SU (x, 0) = g(x), g(x) = given initial velocity. 
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Apply similar methods to those that we used in the heat equation example to obtain 
d’ Alembert’s solution: 


iG Tfaa) Leal af. swe 


2a J} x—at 


Now that we have briefly reviewed the theory of the Fourier transform for 
rather nasty functions, it is time to describe the theory of the Fourier transform for 
distributions. The multiplication formula (property 8 of Theorem 1.2.1) suggests 
that the Fourier transform 7 of a distribution T should be defined by the equation 
T (g) =T (8) for all test functions g. The only problem is that when g and @ are both 
test functions, then g must be identically zero, as we shall soon see. It is not, in fact, 
surprising that the support of ¢ should be like that of exp(277ix). 

The moral is that we cannot easily define the Fourier transform of every 
distribution T. Instead we must restrict ourselves to tempered distributions. There is 
another way out of this dilemma which makes 7 a new sort of generalized function 
(see Gelfand and Shilov [206]). 


Definition 1.2.3. A distribution T is said to be a tempered distribution if it 
extends to a continuous linear functional on the space S of Schwartz functions. 


In order to understand what we mean by continuity in the definition above, we 
must define what we mean by convergence of a sequence of Schwartz functions 
gn to a Schwartz function g. This means that all of the sequences x“D’ (gn — g) 
converge to zero uniformly on R” for all a,b in (Z+ )”, using notation (1.3) which 
was set up at the beginning of this section. 


Exercise 1.2.9. Show that the Fourier transform defines a continuous linear map 
from S onto S, using the preceding definition of continuous. 


Definition 1.2.4. Suppose that T is a tempered distribution. Define the Fourier 
transform 7 by T(g) = T (8), for all Schwartz functions g. 


Exercise 1.2.10 (Distributions Tempered and Not So Tempered). 


(a) Show that the following distributions are all tempered: 
the distribution of a bounded or Lebesgue integrable function, 
any distribution with bounded support. 

(b) Show that e* does not define a tempered distribution on R. 


Theorem 1.2.3 (Properties of the Fourier Transform of Tempered Distribu- 
tions). 


(1) The Fourier transform of a tempered distribution T with bounded support is a 
function: 


V(s) =T (exp(—2mi'sx)). 
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Moreover, V may be continued to all complex vectors s € C” as an entire 
function with bound 


IV(s)| < Cesl (1 + Jsl"), 


for some A and N. The converse of this theorem is also true and is called the 
Schwartz—Paley—Wiener theorem. 

(2) Fourier Transform Exchanges Differentiation and Multiplication by 
Polynomials. Using the notation introduced in the definition of the Schwartz 
space, we have: 


D° (Î) = ((—2mix)"TY and (DT) = (2nix)*P. 


(3) The Convolution Theorem: Suppose that S and T are two distributions with 
bounded supports. Their Fourier transforms are functions and 


(SxT) = ST. 


(4) Interchange of Unit for Convolution and Unit for Multiplication: 


A 


o=1. 


(5) Fourier Inversion: Suppose that T is a tempered distribution and g is a 
Schwartz function. Define T~ by T~(g) = T(g~), where g7 (x) = g(—x) for 
Schwartz functions g. Then 


ray, 
A Few Proofs. 


(1) Note that V(s) makes sense because T has compact support and thus extends to 
the space of infinitely differentiable functions. For the same reason, there is a 
constant coefficient differential operator D with T = DF for some continuous 
function F. Then D is a sum of terms involving differential operators L of even 
or odd order. This allows us to move L around to g with a +. Thus, for Schwartz 
functions g, we have (setting T(g) = (T,g)): 


(7,8) = (7,8) =) (LF, 8) 
= Lt fF) (4 fexo(—2mi 'sx)g(s)as) dx 


= ((T(x),exp(—2mi'sx)),g(s)), 


upon interchanging orders of integrals. Proofs of the Paley—Wiener theorem can 
be found in Dym and McKean [147] and Schwartz [566, Chap. VI]. 
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Table 1.1 A short table of Fourier transforms of one variable 


f(x), xER f(y) 
ee ey 
ae TF 
0, |x| > 1/2 
T(x) = 1/2, x=1/2 an 
1, |x| < 1/2 
ô(x) 1 
5 (6(x— 5) + 5(x+ 5) cos(zry) 
T 
2/7 | ny |572 
=; ylo|  K-1@zlyal), y#0 
(+x) °,q40, Res>0 MX) | 4 À H bal) 
r(2)F(s-7) 
T 2 gq, y=0 


(3) 
(S*T,g) = (S*T,8) = (S(y), (T(x), 8(x+y))) 
= (S), (T (x), (exp(—2miyx)g(x)) ~)) 
= (F(x), g(x) (S(y),exp(—2zi‘yx))) = (S(x)F (x), g). 


Exercise 1.2.11. Prove parts (2), (4), (5) of Theorem 1.2.3. 


It is clear from part (1) of Theorem 1.2.3 that if g and ê are both test functions, 
then g must be zero everywhere, since a complex analytic function of one variable 
which is supported on a bounded infinite set must be identically zero (see Ahlfors 
[3, p. 127]). 


Exercise 1.2.12. Verify Table 1.1. 


Hint. For the last formula, you need some properties of the gamma and K-Bessel 
functions (see Lebedev [401] or see the first exercise in the second section of 
Chap. 3). 


In the (good?) old days one needed a long table of Fourier transforms such 
as Erdélyi et al. [165] or Oberhettinger [500,501]. These days many computer 
packages such as Mathematica have such tables built in. 


1.2.2 Applications to Partial Differential Equations 


Next let us consider some applications of Fourier transforms of distributions to 
partial differential equations. The general idea is the same as that which we used 
to solve the heat equation earlier. Suppose that D is a constant coefficient partial 
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differential operator and S is a tempered distribution. To solve the equation 
DT =S 
for T, try Fourier transforming the equation. You get 
DT = MT =$, 
where M is multiplication by a polynomial. So you want to write 
7 =S/M. 


Then Fourier inversion will find T. This makes perfect sense if the polynomial M 
never vanishes. But otherwise there is much work to do in order to legalize division 
by M. This has been done by Hérmander [303] and Lojasiewicz [426]. See F. G. 
Friedlander [186, pp. 139ff], for a discussion of the theorem of Malgrange and 
Ehrenpreis to the effect that DE = 6 can be solved for E. 

In the examples that follow we will Fourier transform the PDE with respect to 
some subset of the variables. This will reduce the problem to an ODE. For more 
examples of this type, see Vladimirov [706]. 


Exercise 1.2.13 (Fundamental Solution of the Heat Equation). We want to solve 


E 
2E L PAE = 5(x,1), xeR”,t>0, A= — ++. 


Claim. E(x,t) = (4a? nt) ™?H (t) exp[—||x||?/(4a2r)], where H is Heaviside’s step 
function 


a(t) =f L120 
0zż<0. 


Hint. Imitate Example 1.2.1. Use ô= l and the fact that the distributional ordinary 
differential equation 


S'+cS = 6(t) = H' (t) 


has the solution S(t) = H (t)exp(—ct). Prove this too. 


Remarks. Note that E(x,t) in Exercise 1.2.13 is positive for all positive ¢ at every 
point x in space. Thus heat must be diffused with infinite velocity! This means 
that the heat equation does not appear to be a realistic model of heat transfer. 
The transport equation appears to be better, but we shall not discuss it here (see 
Vladimirov [706]). 


Example 1.2.2 (Fundamental Solution of Laplace’s Equation in RÌ). We want to 
solve the following PDE in R? : 
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AE = ô. 


Claim. E(x) = —(4z]|x||)7!. 


Discussion. Formally Fourier transform the PDE with respect to x. One obtains: 
—4n7||x||?E = 1. It is possible to justify thinking of ||x||~? as a tempered distribution 
since the behavior at 0 and œ is not really so bad. Thus E(—x) is the Fourier 
transform of (—477||x||?)~!. Suppose that g is a Schwartz function and write, as 
usual, T(g) = (T, g). Then 


(E(—x),8) = ((—427||x||?)~*, 8) 


= tim | (—4n2|[x||2)7! J g(u) exp(—2mi'xu)du dx. 


Re J \x|<R 


Interchange the order of integrals and use polar coordinates on the integral over the 
x variable. Then the x integral becomes: 


25 pe pe 
- | 7 exp(—2zi||u||rcos@) sin @d@dr 
r=0 J0=0 


1 R' sinr 
. (R' = 2n|lu||R 
ah, ear Œ = mele) 


By Exercise 1.2.14, this completes the proof that (E(x), g) = (—(42||x||)~!,g). 


Exercise 1.2.14 (An Improper Riemann Integral That Is Not Absolutely Con- 
vergent). Show that 


Explain why the integral does not converge absolutely. 


Exercise 1.2.15 (Fundamental Solution of the Wave Equation in R*). Define 
the simple surface layer distribution for a sphere 


Sr={xeER?| |x| =R}, 
by 


Ose (g) = P g(x)dA, dA= surface area elementon Sr. 
xESR 


Then show that the solution to the distributional equation OT = 6(x,t), where 
xeR?, t € R, and 
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(= ¢(5+5+5)) 
oN or ox ox ox , 


is T = (4ma2t)~! H(t) ős, (x). 


Remarks. The last exercise can be used to derive Kirchoff’s classical formula for 
the solution of the Cauchy problem: 


{ au = h(x,t), x€R3,t>0, 
u(x,0) = f(x), u(x,0) = g(x). 


assuming that f,g,h are all sufficiently smooth (see Vladimirov [706]). 


Huyghens’ principle also follows from the last exercise, since a disturbance 
6(x,t) propagates as a spherical wave along the spherical surface ||x|| = at, moving 
with speed a. After the wave passes a given point, there is no more disturbance. This 
principle fails in R and in even dimensional Euclidean space (see Exercise 1.2.8 and 
Vladimirov [706] or Dym and McKean [147]). Clearly, Flatland would be a very 
noisy place! 

It is possible to give many interpretations of the Fourier inversion formula 
that will aid us in our search for non-Euclidean analogues. The most important 
interpretation for this book is that Fourier inversion provides a spectral resolution of 
the Laplace operator on R”. That is, we are investigating the spectral theorem for 
a very special unbounded self-adjoint operator A on the Hilbert space 2? (R”). 
For the Fourier inversion formula says that a nice function f : R” — C has a 
representation as an integral of elementary eigenfunctions 


a(x) = exp(2mi'ax), a,x € R”, (1.4) 
of the Euclidean Laplacian: 


oe ə? 


For if f : R” — C is Schwartz, we have the spectral resolution 


s= f fody with f0)=(fe)= h Sde a9 


mE 
Note that the eigenfunctions e4 are not in the L?-space involved. This makes the 
elaboration of the spectral theorem rather intricate (see Maurin [458]). It is our 
aim to obtain analogues of Fourier inversion for symmetric spaces such as the non- 
Euclidean upper half-plane of Chap. 3. We give more information on the spectral 
theorem in Sect.2.2. For a discussion of the general spectral theorem the reader 
could also look at Maurin [458], Reed and Simon [540], or Yosida [748]. 

Another possible interpretation of the Fourier inversion formula comes from the 
theory of group representations. In this theory one views the eigenfunctions e,(x) 
in the preceding paragraph as irreducible unitary representations of the additive 
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group R”, since ea(x +y) = ea(x)ea(y). We will not emphasize this aspect here, 
although we give an introduction to group representations in Sect. 2.2. If the reader 
is interested in group representations there are many good references, such as Dym 
and McKean [147], Hamermesh [246], Knapp [356], Mackey [442-444], Talman 
[655], Michelle Vergne [697], and Vilenkin [704]. 

Exercise 1.2.16 (Heisenberg’s Uncertainty Principle). Suppose that || f||2 = 1. 
Then since || f||2 = ||f|l2, both [f(t)|? and fw)? can be viewed as densities of 
probability distributions. Translation of f just leads to a phase shift in f (and 
similarly for translation of f). So we can assume that the means vanish; i.e., 


farcorar =0 and fri aw =0. 
The variances are then 
o = f PIO Par and az= fw |f| aw. 


Then o; measures the time duration of the signal f(t) while o, measures the 
frequency spread. The problem is to prove that 


0; Oy > 1/(47). 


Prove also that equality holds for f(t) =Cexp(—at’), with a > 0, C chosen to make 
|| f|l2 = 1. You may assume that the functions involved are Schwartz functions. Then 
use Theorem 1.2.1, part (3) and Exercise 1.2.5, part (c) to see that it is equivalent to 
show that 


[itoa f Ip (Pare 7. 


See Dym and McKean [147, p. 119], for the details when f is only square integrable. 


1.2.3 Laplace Transforms 


Many engineers prefer to use the Laplace transform rather than the Fourier 
transform. 


Definition 1.2.5. The (one-sided) Laplace transform £ of a function f : Rt — C 
is defined for s € C by 


oo 


(EP) = | Fedt. 


0 
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One advantage is that Cf tends to exist when 7 does not; for example, when 
f is merely bounded or even exponentially increasing, assuming that the real part 
of s is sufficiently large. Another advantage is that one can make use of the power 
of the theory of complex variables (see, e.g., Exercise 1.2.20). It is also possible 
to define Laplace transforms of functions of several positive real variables and of 
distributions. See Doetsch [136], Erdélyi et al. [165], Folland [182], Oberhettinger 
and Badii [502], Schwartz [565], Sneddon [609], or Widder [735] for more 
information about the Laplace transform. Programs such as Mathematica will 
compute Laplace transforms. 


Exercise 1.2.17 (Properties of the Laplace Transform). We write Lf for Lf(s). 


(a) Existence. We shall say that a function f : Rt — C is admissible if it is 
piecewise continuous on every finite interval and if there are constants a in 
R and M in R* such that 


|f(x)| << Mexp(ax) forall x in R”. 


Show that if f is admissible, then £L f(s) exists when Res > a. 
(b) Derivatives. Suppose that f and f’ are admissible. Then show that 


Lf) =s(Lf)—F0) and (EA) =L(—xf a) 


(c) Integration. Suppose that f is admissible and that | f (x) /x| < Cx®~!, for € > 0. 
Then show that 


f Erodo = £0F0)/2) and e( f roa) LEA. 


S 


(d) Convolution. Suppose that f,g, and fg are all, admissible. Then prove that 


CPOE) =L( f fe-s), 
LU) = zi fg ENOC -ds 


Exercise 1.2.18 (Inversion of the Laplace Transform). Suppose that e~“ f(x) 
lies in L! (R) and f(x) vanishes for negative x. Assume also that f(x) is piecewise 
differentiable. Use Exercise 1.2.7 to show that 


1 1 ctir 
5 CCH + FG) = lim z fe cp (s)ds. 
Exercise 1.2.19. Compute the Laplace transform of 


(a) xf, Re(a) > —1. 
(b) ô(x—a),a>0, 


1.2 Fourier Integrals 23 


(c) the square wave of period a which has the values 


es Openers 


0,a/2<t<a. 
Here f(t +a) = f(t) for all t. 


Exercise 1.2.20 (Fourier Series via Laplace Inversion). 


(a) If f(x) is periodic, with period a > 0; i.e., f(x +a) = f(x), show that 


(cA) = 0- | Fladexp(—sx)de. 


(b) Use (a) to obtain the Fourier series expression for a periodic function f, 
assuming that the function f(x) is sufficiently nice. 


Hint. See Sneddon [609, pp. 166-169]. For part (b) use Cauchy’s residue theorem 
to see that the Laplace inversion integral gives a Fourier series via part (a). 


The Laplace transform can be used in the same sort of way as the Fourier 
transform to solve differential equations. It has also seen much use in the study of 
the asymptotics of functions. For example, it is useful in number theory where one 
wants to study asymptotic properties of the sequence of prime numbers. Similarly, 
as we shall see, it allows one to study the asymptotic properties of eigenvalues of 
the Laplace operator, by making use of solutions of the heat equation. The Laplace 
transform results needed here are called Tauberian theorems. Such theorems involve 
Laplace-Stieltjes transforms: 


[ ” exp(—st)da(t), 
0 


with œ of bounded variation and normalized to make a(0) = 0 and a(x) = 
(a(x+) + o(x—))/2. See Apostol [11] for a treatment of Riemann-Stieltjes 
integrals. Before thinking about Tauberian theorems, however, one should consider 
the following. 


Theorem 1.2.4 (An Abelian Theorem). Suppose that f(s) = Jy exp(—st)da(t) 
for s > 0, where a is a normalized function of bounded variation and 


a(t) ~At°/JT(c+1), ast— œ (ort — 0+). 
Then 
f(s) ~As~*, ass—>0+ (ors—> œ). 


The Abelian theorem is not hard to prove (e.g., see Widder [735]). Its converse 
is not true without extra hypotheses. Such a converse follows. 
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(Xo: Y0) 


(x,y) 


(0, 0) 


Fig. 1.1 A particle sliding along a curve under the influence of gravity 


Theorem 1.2.5 (A Tauberian Theorem). Suppose that a(t) is a nondecreasing 
normalized function of bounded variation. And suppose that 


f(s) = f exp sidat) 
0 
converges for all s > 0. If for some nonnegative number c 


f(s) ~ As, ass—>0+ (or soc), 
then 
a(t) ~ At /T(c+1), ast— œ (or t— 0+). 


There is a proof of the preceding theorem in Widder [735, p. 192]. We will see 
an application of this result in the next section. 


Exercise 1.2.21 (Abel’s Integral Equation: The Tautochrone). Consider a parti- 
cle of mass m sliding along a curve under the influence of gravity (with no friction), 
starting at (xo, yo) and ending at (0,0), as pictured in Fig. 1.1. Let T (yo) be the time 
for the particle to fall from (xo, yo) to (0,0), assuming the shape of the curve is given 
by some function y = f(x). Then conservation of energy says 


YO 


dx 2\ 1/2 
V28T (yo) = o9 O)O0- 3) "dy, #0) = (+(¥) | 


where g is the acceleration of gravity and the curve is assumed to look like that in 
Fig. 1.1. Derive this integral equation. 

Now assume that T = Tọ = constant; i.e., that the time of descent is independent 
of the starting point. The tautochrone problem is to find the curve y = f(x) under 
this hypothesis. Solve this problem by taking the Laplace transform of both sides of 
the integral equation. Then use the convolution property of the Laplace transform 
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plus the fact that £(y~!/*) = (a/s)'/?. The result is that 9! (y) = (2gTn-?y7!)!/2. 
This gives a differential equation for x(y), which has a cycloid for its solution. 


1.2.4 Application to Probability and Statistics: The Central 
Limit Theorem 


The Fourier transform can also be used to derive the central limit theorem, which 
is one of the most fundamental results in probability and statistics. Special cases of 
this theorem go back to DeMoivre and Laplace. The general result was proved by 
Lindeberg in 1922. Previously Liapunov, Chebychev, and Markov had proved the 
theorem under more restrictive hypotheses. The proof using Fourier transforms was 
developed from 1925 to 1940 using P. Lévy’s proofs of properties of Fourier trans- 
forms of probabilistic density functions called characteristic functions. References 
for probability and statistics are Cramér [115], Feller [177], Kolmogorov [365], and 
Loéve [425]. Collections of papers on applications to physics are Bharucha-Reid 
[38] and Wax [723]. 

Let us give a brief summary of the main definitions. For a historical perspective 
on the subject, see M. Kac [331]. For example, Kac recalls that in 1936 “independent 
random variables were to me (and others, including my teacher Steinhaus) shadowy 
and not really well-defined objects.” 

Let S be a measurable set in some Euclidean space, Let P be a probability 
measure on the Borel sets in S. This means that P(S) = 1. A random variable 
X : S — R is a measurable function. The distribution function Fy of X is defined by 


Fx(t) = P({w € S| X(w) <t} =P(X <0). 
If 


ROR) = T "ptd 


for some nonnegative measurable function fx, we call fy the density function 

for X. The density function can be obtained via the Radon—Nikodym theorem 

provided that the probability distribution on R is absolutely continuous with respect 

to Lebesgue measure. Otherwise one can use the Lebesgue decomposition theorem 

to split the probability distribution on R into a sum of an absolutely continuous and 

a singular part (see Lang [387], Feller [177, Vol. II, pp. 138—143], or Loéve [425]). 
Given two random variables X,Y, their joint distribution is 
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F(a,b)=P{weS|X(w)<a and Y(w) <b} =P(X <a, Y <b). 


We say that X and Y are independent if F (a,b) = Fx (a)Fy (b). 
If X is a random variable, we say that the expectation or mean value of X is 


The variance of X is 


o? = Var(X) = E(X—w)*)= fr wf (0dr, 


—oo 


where o is the standard deviation. 


Exercise 1.2.22. Suppose that X and Y are independent random variables with 
density functions fy and fy. Prove that the density function of the random variable 
X +Y is the convolution fy * fy, where convolution is defined by formula (1.2) of 
Sect. 1.2. 


The characteristic function y of a random variable X is the Fourier transform 


: EE = 
Px) =E(e™) = [el fx (udu = fx(-1/22). 


co 


Theorem 1.2.6 (Properties of Characteristic Functions). Suppose that X is a 
random variable with density function fx = f. Then f € L! (R), f > 0, f f(x)dx=1 
and the characteristic function of X, which is p(t) = f(—t /2r), has the following 
properties. 


(1) The function @ is continuous, p(0) = 1, and |ọ(t)| < 1 for allt. 
(2) The random variable aX + b, for a,b € R, has the characteristic function 


Pax +o(t) = e™ x (at). 


(3) If X and Y are two independent random variables, then the characteristic 
function of the sum X +Y is 


Px+ry (t) = Ox (t) Or (t). 


(4) Fourier inversion. Suppose that @ is the characteristic function of the random 
variable X and suppose that @ € L! (R). Then X has a bounded continuous 
density function f given by 
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6 4 -2 2 4 6 


Fig. 1.2 Graphs of normal density curves Go(x,0), from formula (1.6) with o = 7/10,I = 
1,2,...,10 


Exercise 1.2.23. Prove Theorem 1.2.6. 
Hint. See Feller [177, Vol. II, Chap. XV]. 


Exercise 1.2.24. Suppose that X and Y are two independent random variables. 
Show that Var(X +Y) = Var(X) + Var(Y). 


A random variable X is said to be normally distributed or Gaussian with 
parameters m and o or normal (m, o) if the density function of X is 


Go(x,m) = o~! (2r)! exp[—(x—m)?/(207)]. (1.6) 


We have graphed some of these density curves in Fig. 1.2. 


Exercise 1.2.25 (Properties of Normal Distributions). Assume that X is normal 
(m, ©); i.e., that its density is Gg(x,m) from formula (1.6). 


(a) Show that E(X) = m and Var(X) = o°. 
(b) Prove that 


P(|X -m| > 20) = exp(—x° /2)dx. 


2 oo 
vV2m l 
(c) Show that the characteristic function of X is ox (t) = exp[imt — (ot)? /2]. 


(d) Show that the sum of any two independent normally distributed random 
variables is itself normally distributed. 
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Theorem 1.2.7 (The Central Limit Theorem). Suppose that {Xn}n>1 is a 
sequence of independent random variables, each having the same density function 
f(x). Suppose that the mean is 0 and the standard deviation is 1; i.e., 


| oax =1, frfa =0, and feta =1. 


Then (Xı +++: +Xn)/./n is nearly Gaussian for large n; i.e., 


be meee a 1 f 3 
P (« < (== <b} ~ —— | exp(—x°/2)dx, asn—> %. 
y/n ) = ) VZT Ja 


This means that 


byn J 1 b 2d 
| (fx x f)(x) re =f exp(—x~ /2)dx, as n>, 


yn n times 


Proof. Using Exercise 1.2.22 and Theorem 1.2.6, one sees that the Fourier trans- 
form of the asymptotic relation in the central limit theorem is 


n 

A S 22 

s) we Ts as n> œ. 
n 


Recall that f(s) = f e~?**f(x)dx. So the hypotheses of the theorem say that 


d, 7 
Fat (0) = -4r 


f(0)=1, 70) =0. 


It follows, using the Taylor formula, that since Ô has a continuous second derivative, 


PE 4 ONO), Oa: tee 
jg) ge eg Pl 


Therefore f(s/ yn)” ~ eR as n — o, 

To go from the Fourier transform of the central limit theorem back to the result we 
want, one must know a “continuity theorem for characteristic functions due to Lévy 
and Cramér (see Cramér [115, pp. 97—98], and Loève [425]). This is analogous to a 
Tauberian theorem (see Theorem 1.2.5). Instead we choose to follow the discussion 
in Dym and McKean [147, p. 116]. 

We have proved that 


lim P, 4.4%,)/yn2%8) = lim Í falx) exp(2mixs//n)dx 


n—-eo 


= exp[—27s"], 
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where 


n(x) = (Fr fE) 


if the number of f's in the convolution is n. 
Let dn be the density for the random variable (X; + ---+Xn)/./n. Then we have 
for Schwartz functions k: 


lim | dy(x)k(x)dx = lim | dy(s)k(s)ds 
R 


n—oo n>% JR 
= l exp(—272s?)k(s)ds = Í (27) !/2exp(—x?/2)k(x)dx, 
R R 


using the Plancherel theorem, dominated convergence, and part (1) of 
Theorem 1.2.2. The proof is finished using the following Exercise. 


Exercise 1.2.26. Complete the proof of the central limit theorem by approximating 


Xas) = 4 x¢ (a,b) 


1, x € [a,b] 


by Schwartz functions k. 


It is possible to relax the hypotheses in the central limit theorem and one can also 
obtain error estimates. See the references for the details. 

There is also a multivariate central limit theorem for random vectors in R”. This 
can be quickly deduced from the case m = 1 (see Anderson [7, pp. 74-75]). 

In this section, we have seen the Gaussian or normal distribution in two 
seemingly different contexts—as the fundamental solution of the heat equation (see 
Example 1.2.1) and as the limiting density function for the normalized sum of a 
sequence of independent identically distributed random variables. The Gaussian 
density also appears in a third context closely related to the previous two; namely, 
in Einstein’s work on Brownian motion. The latter is the motion—visible with a 
microscope—of tiny particles suspended in a liquid. Such motion had been observed 
by Robert Brown in 1827, as well as many others before him. Einstein proved in 
1905 that if S$, denotes the displacement after t minutes of a particle in Brownian 
motion then S; has density function 


(42Dt)~'/? exp(—x?/(4D1t)), 


where D depends in a very explicit way on the temperature and friction coefficient of 
the medium. See the references mentioned above for more information on Brownian 
motion. Another interesting reference is Nelson [493]. 

We will find in Chap.3 that we can generalize the central limit theorem to the 
non-Euclidean case, using the non-Euclidean Fourier transform. The proof will be 
basically the same. 
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1.3 Fourier Series and the Poisson Summation Formula 


Jétais d’abord parvenu à plusieurs de ces équations par des éliminations trés-laborieuses, 
mais j’emploie maintenant une règle beaucoup plus générale et trés-expéditive pour 
résoudre une fonction arbitraire quelconque en série de sinus ou de cosinus d’arcs multiples. 
Ces résultats confirment pleinement I’ opinion de Daniel Bernoulli. 

Les développements dont il s’agit ont donc cela de commun avec les équations aux 
différences partielles qu’ils peuvent exprimer les propriétés des fonctions entièrement arbi- 
traires et discontinues; c’est pour cela qu’ils se présentent naturellement pour l'intégration 
de ces dernières équations, et leur application offre des facilitiés singulières dans les 
questions des lignes ||, mouvements des fluides, de la propagation du son, des vibrations 
des corps élastiques, et donne un moyen aisé de déterminer les mouvements avec toute 
la généralité que lon obtiendrait de l’emploi des fonctions arbitraires. J’en ai fait une 
application plus particulière à la question de la propagation de la chaleur et l’on parvient 
ainsi à reconnaitre distinctement comment elle se propage par ondes successives dans 
l'intérieur des corps.” 

—From Fourier’s 1805 draft of his work on the theory of heat propagation in Grattan- 
Guinness and Ravetz [230, pp. 185-186]. 


Als Fourier in einer seiner ersten Arbeiten über die Wärme, welche er der französischen 
Akademie vorlegte, (21. Dec. 1807) zuerst den Satz aussprach, dass eine ganz willkürlich 
(graphisch) gegebene Function sich durch eine trigonometrische Reihe ausdriicken lasse, 
war diese Behauptung dem greisen Lagrange so unerwartet, dass er ihr auf das Entschieden- 
ste entgegentrat. Es soll sich hierüber noch ein Schriftstück im Archiv der Pariser Akademie 
befinden. Dessenungeachtet verweist Poisson tiberall, wo er sich der trigonometrischen 
Reihen zur Darstellung willkiirliche Functionen bedient, auf eine Stelle in Lagrange’s 
Arbeiten über die schwingenden Saiten, wo sich diese Darstellungsweise finden soll. Um 
diese Behauptung, die sich nur aus der bekannten Rivalitaét zwischen Fourier und Poisson 
erklären lässt, zu widerlegen, sehen wir uns genothigt, noch einmal auf die Abhandlung 
Lagrange’s zuriickzukommen; denn tiber jenen Vorgang in der Akademie findet sich nichts 
veröffentlicht. 

... [Here Riemann discusses a formula in Lagrange’s work on the vibrating string]. 

... Hatte Lagrange in dieser Formel n unendlich gross werden lassen, so ware er 
allerdings zu dem Fourier’schen Resultat gelangt. Wenn man aber seine Abhandlung 
durchliest, so sieht man, dass er weit davon entfernt ist zu glauben, eine ganz wirkiirliche 
Function lasse sich wirklich durch eine unendliche Sinusreihe darstellen.* 

—From Riemann [542, pp. 232-233]. 


? At first I arrived at several of these equations by very laborious eliminations, but I now use a much 
more general and expeditious rule to resolve any arbitrary function in a series of sines or cosines 
of [integral] multiples of angles. These results fully confirm the opinion of Daniel Bernoulli. 

As with partial differential equations, the expansions in question can express the properties 
of entirely arbitrary and discontinuous functions. It is for that reason that they naturally present 
themselves for the integration of these last equations and their application offers singular facility 
for questions of parallel lines, fluid motion, sound propagation, vibration of elastic bodies, and 
gives an easy method for determining the motion in all the generality that can be obtained through 
the use of arbitrary functions. I have made a more particular application to the question of the 
propagation of heat and one thus succeeds in recognizing distinctly how it propagates by successive 
waves in the interior of a body. 


3Fourier, in his first papers on heat submitted to the French Academy (December 21, 1807) 
was the first to formulate the theorem that an arbitrarily (graphically) given function could be 
represented by a trigonometric series. This statement was so unexpected to old Lagrange that 
he refuted it vehemently. It is claimed that the Archive of the Paris Academy contains such 
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1.3.1 Fourier Series 


Historically the study of Fourier series of periodic functions preceded the study of 
Fourier integrals. We have reversed the order because it reverses itself naturally 
in later chapters of this book. Electrical engineers have sometimes argued that 
one should look solely at Fourier integrals, since nothing in nature is really 
periodic (see Bracewell [61, Introduction]). However, Fourier series are of great 
importance, both in pure and applied mathematics. They are so essential that people 
have designed machines to compute the coefficients of many specialized sorts of 
functions. In 1903, for example, Michelson had a machine which would find the 
Fourier coefficients of a sound using vertical rods, each vibrating at a different 
frequency. Today people use computers and the fast Fourier transform to analyze 
all sorts of phenomena, as we shall see later in this section. At this point, let us just 
quote an article on weather prediction that appeared in Science (Vol. 220 (1983), 
p. 40): “Fourier transform methods solved that problem [of making the computer 
time necessary for the calculations affordable] in the early 1970s, allowing [weather] 
forecasters to transform the grid mesh into a wave representation, perform the 
nonlinear calculations, and transform the results back to a grid with no substantial 
penalty in computing time.” 

Suppose that f : R” /Z” — C. This means that f has period one in each variable. 
We can also think of f as a function on the unit hypercube (0, 1]” in R”, with 
opposite sides identified. We say that the unit hypercube is a fundamental domain 
for R” /Z”. We can also think of R/Z as a circle and R”/Z’” as a product of m 
circles, or a torus. In the case m = 2, we get a doughnut. 


Definition 1.3.1. The Fourier series of f : R” /Z” — C is 


È (f,ea)ea(x), 


acg™ 


where ea(x) = exp(27i‘ax), for x € R”, a € Z”, and the Fourier coefficients are 


(frea) = J f(v)ea(y)dy. 


(0,1]" 


Note that the functions ea(x), fora € Z”, are eigenfunctions of the Laplacian 


a document. Nevertheless, whenever Poisson uses trigonometric series for the representation of 
arbitrary functions he refers to a passage in Lagrange’s papers on the vibrating string where such a 
representation is supposed to be found. In order to refute this statement which can only be explained 
by the well-known rivalry between Fourier and Poisson we have to come back to Lagrange’s paper 
because nothing has been published about Lagrange’s opposition in the Academy. 

If Lagrange had considered these formulas for infinite n he would have obtained Fourier’s 
result. However, when one goes through his paper one sees that the notion of representing an 
arbitrary function by an infinite trigonometric series is very foreign to Lagrange. 
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a? a? 
A= a Oe 


which have period one in each variable. The first two statements of the theorem 
below say that nice functions f are represented by their Fourier series. This can 
be interpreted as the spectral resolution of A on R”/Z. This parallels the view of 
Fourier integrals given by formulas (1.4) and (1.5) of Sect. 1.2 . It is this point of 
view that we will seek to generalize in Chap.3, where we replace Z” by discrete 
groups of 2 x 2 integer matrices. 


Theorem 1.3.1 (Properties of Fourier Series). 


(1) Suppose that f is in L?(R"/Z"). Then f is the L?- limit of the partial sums of 
its Fourier series. Moreover, we have the Parseval identity: 


È ea)? = IL ld. 


aezm 


(2) Suppose that f € L'(R™/Z™) and that 


È I(fea)| <. 


aez™ 


Then you can change f on a set of measure 0 to make f continuous on R” /Z™ 
and such that 


f(x) = DY (frea)ea(s). 


aezm 


(3) Suppose that f : R” /Z” — C has continuous partial derivatives of all orders 
less than or equal to k. If k > m/2, then the Fourier series of f converges 
uniformly and absolutely to f. 

(4) Riemann-Lebesgue Lemma: Suppose that f € L'(IR"/Z!). Then (f,ea) 
approaches zero, as ||a\| approaches infinity. 


Proof. (1) For example, the Stone—Weierstrass theorem (see Lang [386, p. 148]) 
says that the exponentials e,(x), a € Z”, form a complete orthonormal set in the 
Hilbert space Z? (R” /Z™”). You can also deduce this easily from Exercise 1.1.3 
of Sect. 1.1. The machinery of Hilbert spaces then completes the proof easily. 
The difference of f(x) and the sum of its Fourier series is a function with zero 
Fourier coefficients and is therefore orthogonal to all trigonometric polynomials 
and thus to all continuous functions. Since continuous functions are dense in 
L!, the difference of f and its Fourier series must be zero except on a set of 
measure zero. 

(3) If we use the notation of formula (1.3) from Sect. 1.2, we have 


(D* fen) = (2nib)“(f, en). 


(2 


Na 
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Then D*f € L? (R” /Z") implies that 


XY Y |(f,es)/7[(2ab)4P < 


lal=k bez” 


Now there is a constant c > 0 such that 


> [(2nb)*? > clib. 


|a|=k 


So the Cauchy—Schwarz inequality enables us to compare the series of absolute 
values of Fourier coefficients and the series ||b||~?, which is an Epstein 
zeta function. To see that the series X, ||b||7?* converges for k > n/2 you can 
proceed by developing a higher-dimensional version of the integral test. See 
Exercise 1.4.5 of Sect. 1.4. Or use Theorem 1.3.5 of this section. 

This is clear for a dense set of L! functions; e.g., functions in L! AL, or 
infinitely differentiable functions. 


(4 


w 


It follows from part (3) of Theorem 1.3.1 that the speed of convergence of 
the Fourier series increases with the smoothness of the function. This sort of 
theorem goes back to Dirichlet in 1829 (see Grattan-Guinness and Ravetz [230, 
pp. 471—474]). Fourier had not quite managed to show that any piecewise-smooth 
function can be represented by its Fourier series, taking the average value at jump 
discontinuities, since we can, in fact, allow the function f(x) to have a jump 
discontinuity at x, setting 

FO) Most ON) and FO) Baat 
The convergence of Fourier series of piecewise-smooth functions is discussed in the 
next exercise. 


Exercise 1.3.1 (Jump Discontinuities and Fourier Series). 


(a) Suppose that f(x) is a piecewise-continuous function on [0,1). Assume that 
both one-sided derivatives of f exist at x in [0,1). Define the Dirichlet kernel 


D,,(x) by 
sin[z(2n + 1)x] l 


D,(x) = > exp(2rikx) = ao 


|k|<n 


Then the nth partial sum of the Fourier series of f is 


(Su) 0) = E (Hreadeats) = (DA = | FODE- 


|k|<n 


Show that (f x D„)(x) approaches 5(f(x+) + f(x—)) as n approaches 
infinity. 
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(b) Generalize the results of part (a) to functions of several variables. Assume that 
the function has sufficiently many continuous derivatives. Use the Dirichlet 
kernel: 


Then f*D, is the partial sum of the Fourier series of f corresponding to the 
sum over b € Z” with |b;| < aj,j =1, ..., m. Show that Da * f approaches f 
as all the a; approach infinity, assuming that f(x) is sufficiently smooth. 


Although we can allow the function f : R/Z — C to have jump discontinuities, 
the partial sums of the Fourier series of such functions will always overshoot the 
mark by about 9 % at the jumps. This is called the Gibbs phenomenon because 
it was pointed out by Gibbs in a letter to Nature in 1899. The letter was a reply 
to Michelson, who was angry that his machine for computing Fourier series failed 
badly at jumps. Actually the phenomenon was first observed by Wilbraham in 1848. 
There are many methods for improving the convergence of Fourier series and these 
are quite important for filter design (see Hamming [247]). The Gibbs phenomenon 
also occurs for Fourier integrals. See Exercise 2.1.15 of Sect. 2.1.2 for some work 
on an analogue of the Gibbs phenomenon in a higher-dimensional setting. 


Exercise 1.3.2 (The Gibbs Phenomenon and Smoothing in the One-Variable 
Case). 


(a) Set 


Show that for large n and small |x|, the partial sums S, f(x) of the Fourier series 
of f, defined in Exercise 1.3.1, will overshoot the mark by about 9 %. This 
is the Gibbs phenomenon. Either graph the result by computer or follow the 
theoretical approach in Dym and McKean [147, p. 43]. 

(b) Show that it is possible to smooth out the horns of the Gibbs phenomenon in 
part (a) by averaging the first n partial sums (Cesaro summation). Show that, 
in fact, 


n 


(K.*f) (x)= —— Esra, 


n+1 5 


with K, = the Fejér kernel in Example 1.1.1 of Sect. 1.1. Graph the result by 
computer showing that Fejér smoothing eliminates the oscillations of the partial 
sums but it has a slow “rise time.” 

(c) Graph the result of the more popular smoothing method called Lanczos 
smoothing, which is defined by 
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Fig. 1.3 Comparison of various smoothing techniques. (From R.W. Hamming [247, p. 74]. 
Reprinted by permission of Prentice-Hall, Inc., Englewood Cliffs, NJ) 


n ptta/n 
Inf (2) = ais Spf (u)du. 


20 n/n 


See the Matlab website for information about smoothing and filters or see 
Hamming [247]. Your answers should resemble the graph of Fig. 1.3. 


Carleson [75] proved that the Fourier series of any Z? function on R/Z converges 
almost everywhere. For R? / Z?, the work of C. Fefferman and others showed that 
the answers to the convergence question depend on the type of partial sums used; 
e.g., sums over squares, rectangles, circles (see J. M. Ash [15]). In fact, Ash notes: 
“At the present time, the passage from 2 to 3 dimensions seems far more substantial 
and nontrivial than that from 1 to 2 or than that from 3 to more.” 

There are examples of continuous functions of one variable which have Fourier 
series that diverge at uncountably many points. And there is an L! function with 
a Fourier series that diverges everywhere (see Kolmogorov [364]). References for 
such results can be found in the collection of articles edited by J. M. Ash mentioned 
in the preceding paragraph. See also the work of Zygmund [757]. For some of the 
older history of Fourier series, see Burkhardt [72], Grattan-Guinness and Ravetz 
[230], Hilb and Riesz [296], and Riemann [542, pp. 227-271]. Theorem 1.3.4 will 
show, however, that if all you seek is distributional convergence, then you always 
get what you want. 

The following exercise is discussed more fully in the last section of this chapter. 
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Exercise 1.3.3 (Weyl’s Ergodic Theorem or Weyl’s Criterion for Uniform 
Distribution). For any irrational number a and any continuous function f(x) of 
period one, show that 


ee l 
lim 2 f (na) =} f(x)dx. 


N= 
Hint. First prove it for trigonometric polynomials. 


Exercise 1.3.4 (The Shannon Sampling Theorem). This result was proved by 
Shannon in [587]. A function for which the Fourier transform is zero outside 
of some interval is called band limited. We seek to reconstruct such a function 
from samples taken at equally spaced points. Suppose that f : R — C and f (y) =0 
for all y with |y| > ye. Show that 


f= dF 


neZ 


pagn 


Hint. First expand f(y) in a Fourier series over the interval |—ye, +ye]: 


A miny 
f(y) = ¥ Anexp (= ) for y € [-ye,+Yc]- 
E 


neZ 


Let py. (x) be 1 if x € [—yc, +yc] and 0 otherwise. Then, for any y € R, 


f(y) = Pr-O) È, Anexp (=> ) , 


neZ Ç 


Use the Fourier inversion theorem, the translation property of the Fourier transform, 
and the formula from Exercise 1.2.6 of Sect. 1.2 giving the Fourier transform of py., 
and you will find the solution of the problem. 


1.3.2 Vibrating Drums 


A vibrating drum D is modelled by the wave equation Au = uy in the interior 
of D. Assume, for example, that the drum is tied down on the boundary (i.e., u 
vanishes on the boundary, the Dirichlet boundary condition) and that the initial 
condition says u(x, y,0) = f(x,y) and u;(x,y,0) = 0. Separation of variables allows 
you to write the solution as a series involving the eigenfunctions of the Laplacian 
on the region. For the unit square, the eigenfunctions of the Laplacian have the form 
Vm n(x, y) = sin(znx) sin(zmy), n,m = 1,2,3, .... The corresponding eigenvalues are 
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Fig. 1.4 On the left is sin(10x)sin(6zy) and on the right sin(10zx)sin(6zy) — 
s/ 2/3 sin(6zx) sin(10zy) 


=n? (n? +m’) . These give the fundamental frequencies of vibration. When you 
solve the wave equation for the initial conditions just stated, you get a Fourier series: 


u(x,y,t) = Ý, cnmsin(snx) sin(zmy) cos (x vn? + m?) ; 


nym>1 


where f(x,y) = £ Cnm Sin(nnx)sin(zmy). 
n,m>1 


Thus when you hear a drum you are hearing the eigenvalues of A. This led to a 
famous question of Kac and Bochner: “Can you hear the shape of a drum?” It was 
asked by M. Kac in a famous paper [332]. Mathematically the question becomes: 
Suppose 2 drums have the same eigenvalues of A, does it follow that then you can 
obtain one drum from the other by rotation and translation? Kac says in his paper 
that the question was first asked by Bochner. In his autobiography [333, p. 144], he 
notes that Lorenz conjectured that the area of the drum could be determined by the 
eigenvalues of A (actually for the analogous situation in three dimensions). Weyl 
proved the Lorenz conjecture in less than 2 years. In 1936 Pleijel showed that the 
length of the boundary of the drum could be determined from the spectrum of A. 
We will have more to say about hearing the shape of the drum in Chap. 3. 

In Fig. 1.4, we imitate Fig. 3 on p. 302 of Courant and Hilbert [111]. Courant and 
Hilbert graphed the nodal lines of eigenfunctions of A on a square. These are graphs 
of the curves given by setting the eigenfunctions or linear combinations of several 
of them equal to 0. Such graphs can be seen in real life by placing sand on a drum. 
Instead we graph density plots for such functions. The graph on the left is that of 
sin(10z-x) sin(6zy) obtained using the Mathematica command: 
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DensityPlot[(Sin[10*Pi*x]*Sin[6*Pi*y]),{x,0,1},{y,0,1}, 
PlotPoints->100,ColorFunction->Hue,Mesh->False,Axes->False] 


The graph on the right is that of sin(10zx) sin(6zy) — \/2/3sin(6z-x) sin(10zy), 
obtained using a similar Mathematica command. 


Exercise 1.3.5. Solve the wave equation on the unit square or unit disc with initial 
condition modelling of what would happen if the drum were “hit” at a point. 
Then make a Mathematica movie of your solution. See Courant and Hilbert [111, 
pp. 300-306]. For the unit disc, one should switch to polar coordinates. The solution 
involves J-Bessel functions which we discuss in Sect. 2.2. See also Powers [520, 
pp. 262-269], a book with a cover showing photographs of vibrating kettledrums 
with dark dust on them—pictures that compare well with the computed nodal lines 
on p. 269 of the book (which are circles centered at the origin plus rays emanating 
from the origin). I reproduced the cover of that book in [668, p. 88]. The photographs 
were themselves taken from a Scientific American article by Thomas D. Rossing on 
the physics of kettledrums, from November, 1982. 


Exercise 1.3.6. If you add a forcing term to the wave equation and solve the 
problem of forced vibration of a membrane, the PDE becomes: uy, — Au = 
f(x,y) cos(@rt). Assume also that u = 0 on the boundary of the membrane and 
that the membrane is at rest with 0 velocity at time 0; i.e., u(x,y,0) = u;(x,y,0) = 0. 
Then write the solution to the problem as 


u(x,y,t) 2D Cn m(t)sin(nnx)sin(zmy). 
n,m>1 
Show that then if vn m(x, y) = sin(anx) sin(amy) and Anm = — n° (n? +m), then 


Cnm(t) = Paar (cost — cos (i / Pant) ) : 


Here 


(f, Vn) = I FRY)Vn ym (x,y) dxdy. 


x,y€ [0,1] 


What happens to the solution if œ approaches +,/]Anm|? This behavior is called 
“resonance.” It leads to a blow-up of the membrane over time. 


There are many famous bridge failures due at least partially to resonance. One 
was Angers Bridge, in Angers, France, 16 April 1850. The collapse was due to 
resonance from marching soldiers. A similar story holds for the collapse of the 
Broughton suspension bridge near Manchester, England in 1831. For this reason, 
soldiers are told not to march in step when going across bridges. 
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The Tacoma Narrows Bridge self-destructed quite spectacularly in 1940. There 
are amazing movies of the destruction on the web. Some texts explain the disaster 
by resonance. Recently that has been disputed. Nevertheless eigenvalues are still 
important quantities to know in determining the stability of such structures. 


M. Braun [64, pp. 173-175] notes: “There were many humorous and ironic incidents asso- 
ciated with the collapse of the Tacoma Bridge. When the bridge began heaving violently, the 
authorities notified Professor F. B. Farquharson of the University of Washington. Professor 
Farquharson had conducted numerous tests on a simulated model of the bridge and had 
assured everyone of its stability. The professor was the last man on the bridge. Even when 
the span was tilting more than twenty-eight feet up and down, he was making scientific 
observations with little or no anticipation of the imminent collapse of the bridge. ” 

“A large sign near the bridge approach advertised a local bank with the slogan ‘as safe 
as the Tacoma Bridge.” 

“After the collapse of the Tacoma Bridge, the governor of the state of Washington made 
an emotional speech in which he declared ‘We are going to build the exact same bridge, 
exactly as before.’ Upon hearing this, the noted engineer Von Karman sent a telegram to 
the governor stating ‘If you build the exact same bridge exactly as before, it will fall into 
the exact same river exactly as before.” 


The phenomenon of resonance is quite general. It works for all sorts of vibrating 
objects, even with nonconstant density and tension, and even in higher dimensions. 
Of course it can be used for good as well as evil. Consider musical instruments, for 
example. Can they be played for evil? I suppose you might be able to destroy a glass 
by making it resonate. 

Closer to modelling an actual bridge than a vibrating rectangular membrane is a 
vibrating rectangular plate with free edges. This changes the PDE to Uxxxx + 2Uxxyy + 
Uyyyy = Ut. See the article of Gander and Kwok [190] comparing eigenmodes 
for this fourth-order PDE and the pictures of the vibrating Tacoma Narrows 
bridge. The article also discusses the Chladni figures which are the nodal lines for 
eigenvibrations of a square plate. 


1.3.3 The Poisson Summation Formula 


Now we want to investigate the connection between Fourier series and Fourier 
integrals. This is the Poisson summation formula. Suppose that f : R” — C is a 
Schwartz function. Form the periodic function (of period one in each variable), 


g(x) = >) fata). (1.7) 


aez™ 


Exercise 1.3.7. Show that the series (1.7) converges uniformly for all x € R”, 
assuming that f is a Schwartz function. 


When m = | and the support of f lies inside (0,1), the graphs of f and g may 
resemble those in Fig. 1.5. If the support of f is larger, the graph of g will be more 
complicated. 
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Fig. 1.5 Periodization of a bump function 


Theorem 1.3.2 (The Poisson Summation Formula). Jf f : R” — C is a Schwartz 
Junction, then 


= J f(x+a) = BA a) exp(2mi‘ax). 


aez™ ez 


Proof. The Poisson summation formula simply says that the periodic function g is 
represented by its Fourier series. For f (a) is really the ath Fourier coefficient of g, 
by the following calculation for a € Z”: 


fla) = f.,, £0) exp(—2ni ‘ay)dy 
=| £ f(y +b)exp [—2zi'a(y +b)] dy 
[0,1]” pez” 


= g(y)exp(—2mi ‘ay)dy. 
(0,1) 


Thus Theorem 1.3.1 implies Theorem 1.3.2. 


In fact, Poisson is not the only name attached to Theorem 1.3.2, for Gauss and 
Cauchy also found the formula (see Burkhardt [72, p. 1338]). However, Jacobi, who 
needed the result in his work on theta functions, attributes it to Poisson (see Jacobi 
[322, p. 307]). 

The hypotheses on the function f(x) in Theorem 1.3.2 should be weakened (see 
Stein and Weiss [636, pp. 250—257]). There is also an interesting discussion of the 
Poisson summation formula in Feller [177, Vol. II, p. 632]. Exercise 1.3.22 is a 
cautionary example on the failure of Poisson summation. See Hejhal [265] for a 
discussion of a related summation formula due to Voronoi. 

It would be useful to be able to derive these results on Fourier series directly from 
the inversion formula for the Fourier transform in Sect. 1.1. Bracewell ([61, pp. 204 
ff.]) claims to do this, but there seems to be a gap in his argument. Exercise 1.2.20 of 
Sect. 1.2 gives a way of deriving the representation of certain functions by Fourier 
series, using the inversion of Laplace transforms plus Cauchy’s integral theorem 
(see also Titchmarsh [678, pp. 4-6]). There is also a short distribution-theoretic 
proof of the Poisson sum formula which we will describe after Theorem 1.3.4. We 
mention this because it would be nice to be able to create similar arguments in 
later chapters. There is another distribution-theoretic discussion of the formula in 
Lighthill [421, pp. 67-68]. 


1.3 Fourier Series and the Poisson Summation Formula 41 


On the other hand, it is possible to derive Fourier integrals from Fourier series of 
functions of period P, by letting P approach infinity. This is done by Titchmarsh in 
[678, pp. 70-73], for example. To sketch the formal argument, suppose that f(x) is 
a Schwartz function on R. Let 


fe(x) = È f(x tnP). 


neZ 
The Fourier series of this function fp of period P is easily seen to be 


1 P/2 
£6= 5% ( jof'0)exp(-2riky/P)ay) exp(2nikx/P) 


1 +P/2 


~ De f(y) exp[2mi(x— y)k/P}dy. 


Then argue that, as P approaches infinity, this last sum approaches the Riemann sum 
for 


I fO)exp[2ri(x— y)wļdydw. 
wER JyeR 


A simpler way of comparing Fourier series with Fourier integrals comes from 
comparing the kernels that result from approximating the Fourier series by its nth 
partial sum in Exercise 1.3.1 and approximating the Fourier integral by that on a 
finite interval [—r,r], as in Exercise 1.2.7 of Sect. 1.2. These kernels are 


_ sin[2(2n + 1)x/P] 


DE 
n) = pin (ex/P) 
and 
sin(27rx) 
W,(x) == a 
Now 


__ sin(m2nx/P)cos(mx/P)  cos(m2nx/P)sin(mx/P) 


Dy(x) = Psin(nx/P) , Psin(ax/P) 


which approaches W,, p(x) as P approaches infinity (for fixed x). 

Next consider Fourier series of periodic distributions. The space of test functions 
D consists of all infinitely differentiable functions on R” /Z’". Here we should think 
of R” /Z” as a product of circles (i.e., as a torus which is a doughnut when m = 2). 
Convergence of a sequence of test functions means uniform convergence of each 
sequence of derivatives. A distribution T on R” /Z” is a continuous linear function 
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Fig. 1.6 The Dirac comb or Shah functional 


T : Ď > C. It is possible to identify T with a periodic distribution T on R” with 
T, =T for all a in Z”, using the definition of T, in Theorem 1.1.1, Part (g) of 
Sect. 1.1. To see this, suppose that g is a test function on R” and set g = the Z”- 
periodization of g: 


ax) = È sata). 


aezm 


Then if T is a distribution on R” /Z”, define the periodic distribution T of period 1 
on R” by 


T(8) =T(g). 


For example, if ô is the Dirac delta distribution then 


ô= > ô (1.8) 


nezm 


Some engineers call the distribution õ on R” the shah functional, for the 
Russian letter LU. Note that if fand g are related by formula (1.7), then ô f=Uf=g. 
One can think of ô as an infinite “Dirac comb” of impulses pictured in Fig. 1.6 
(see the article of Sakai in Vanasse [689, p. 7]). The Fourier transform of 6, is the 
function exp(27i ‘ax). Thus the Fourier transform of the shah functional is 


U(x) = X, exp(—2zi'ax). (1.9) 


aez™ 


The Poisson summation formula is exactly the statement that the shah functional is 
its own Fourier transform; i.e., that LU = LL. 


Theorem 1.3.3 (A Criterion for Distributional Convergence of a Trigonometric 
Series). A trigonometric series 


£ cnexp(2ri'nx) 


nezn 


converges in the distributional sense (see Sect. 1.1) if there are positive constants 
C,a such that 


Icn| <C|ln|\*, forall nez”. 
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Proof. Set 


f(x) = $, |[2min||~**cn exp(27i ‘nx). 
nEZn—(O 


This series converges uniformly to a continuous function f(x) provided that k is 
sufficiently large (by the proof of Theorem 1.3.1). Now apply A‘, where A is the 
Laplace operator, to obtain Lc, exp(27i 'nx), which must therefore converge as a 
distribution. 


Bochner [44] notes that Riemann had already considered this sort of convergence 
long before Schwartz’ book [566]. 


Definition 1.3.2. Let T be a distribution on R”/Z”. The Fourier series of T is 
den(T) exp(27i ‘nx), with ca(T) = (T(x), exp(—27i'nx)). 

Now we can lay to rest all worries about convergence of Fourier series in the 
sense of distributions. 


Theorem 1.3.4. The Fourier series of any distribution T on R” /Z” converges toT. 


Proof. First note that the nth Fourier coefficient of 5 on R”/Z™ is 1, since 
ô(en)=1. And the Fourier series of 6 converges to 6 by Theorem 1.3.1 or 
Theorem 1.3.2. For this just says that the Fourier series of any test function 
converges to the function when everything is evaluated at x = 0. 

Now let T be any distribution on R” /Z”. Then 


nezm nezm 


T =T*6 =Tx ( x e) = ¥ (Ten). 
Here all convolutions are over R” / Z”. Finally, 


(Ten) (x) = (T(y),exp(2mi'n(x—y))) = exp(2ri 'nx)(T (y) ,exp(—2ri'ny)). 


This completes the proof of Theorem 1.3.4. 


Now let us discuss a distribution-theoretic proof of the Poisson sum formula for 
R/Z from Donoghue [137, p. 162] (cf. also Friedlander [186, p. 104]). Let T be a 
periodic tempered distribution. Then, since Tą = T for all a € Z, we have 


(erias _ 1)T(s) —(, 


Thus Î(s) must be supported on the zeros of e?”@* — 1; i.e., on Z. Then, using the 
first general fact about distributions that was stated in Sect. 1.1, it follows that T 
must be a sum over a € Z of linear combinations of derivatives of 6,. Now suppose 
that T = L, the shah functional. Then the fact that shah is its own Fourier transform 
follows fairly easily. 
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Next we consider some applications of Poisson’s summation formula. The first 
exercise can also be viewed as a example of the method of images (see Sommerfeld 
[610, pp. 72-74], and Exercise 1.3.23). We will see many applications of this result 
throughout this book. 


Exercise 1.3.8 (Jacobi’s Transformation Formula for the Theta Function). 
Show that for x € R” and t > 0 the following equation is a consequence of 
Theorem 1.3.2: 


>” exp [—2mi'ax—4n7cllal|?t] = (4act)-"/? ¥ exp [= la +x? /(4ct)|. 


aez™ aczZm 


In the special case m = 1, x = 0, the theta function of Exercise 1.3.8 is the 
partition function for the planar rigid rotator in quantum-statistical mechanics. And 
it approaches the high-temperature limit as t — 0+, a limit that can be computed 
from the transformation formula in Exercise 1.3.8 (see Hurt [308, p. 10]). 


Exercise 1.3.9 (Heat Diffusion on a Circle). Consider the problem 


qu — (Fy 
ot 
be 


Claim. The solution is the same as that of Example 1.2.1 in Sect. 1.2. The exercise is 
to start out writing the solution as a Fourier series in x. The Fourier coefficients will 
be functions of t satisfying an ordinary differential equation that is easily solved. 
This allows you to write the solution u(x,t) as a convolution over [0,1] of f with the 
theta function in Exercise 1.3.8. Then use that last exercise to see that the solution 
is really the same as that of Example 1.2.1 in Sect. 1.2. 


x€R/Z, t>0, 
fe ), xER/Z. 


Exercise 1.3.10. Derive some more results from the Poisson summation formula, 
such as the following result for t > 0: 


2 2nn\?\ — 
X exp(—|n|t) = ; 5 ( H ( ; ) 
neZ neZ 


Justify the formula, even though exp(—|x|) is not a Schwartz function. 


A more general result of this type involves (for t > 0) 


5 (t+) 


neZ 


when Re(s) > 5: The K-Bessel function will enter into the last formula. And this 
result will appear in Chap. 3 as the Fourier expansion of Epstein’s zeta function. 
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Theorem 1.3.5 (Asymptotics of the Eigenvalues of A on Z? (R” /Z”)). Let N(x) 
be the number of eigenvalues À of A such that |A| < x and Ah = Àh for h #0 and 
h € L? (R” /Z"). The eigenvalues are counted with multiplicity. Then 


N(x) ~ (4n)7"?0(1+m/2)-1x"/? as x > o. 


Proof. Clearly a complete orthonormal set of eigenfunctions of A on R”/Z” 
consists of the exponentials e,(x) = exp(2ri ax), a € Z”. Now 


Aea = 47° |ja||?ea. 
It follows that 
N(x) = Ha € Z” | 4r°llal? < x}. 


When m = 2 this is the number of lattice points inside of a circle of radius y/x/ (27). 
One could give a simple geometrical argument to prove Theorem 1.3.5 (cf. Hardy 
and Wright [251, pp. 270-272] or Courant and Hilbert [111, p. 430]). But we choose 
instead to use the Tauberian theorem which was Theorem 1.2.5 of Sect. 1.2. 

So we look at the Laplace transform of N(x). This Stieltjes integral is really the 
sum 


> exp[—427 lalli] = (4ae)-"/? E exp[—|lal|?/(42)]. 


aezm nezmn 


Here we have used Exercise 1.3.8, which is a simple application of the Poisson 
summation formula. The right side of the last equality is asymptotic to (4rt) 2 
as t — 0+. Thus the Tauberian Theorem (Theorem 1.2.5 of Sect. 1.2) finishes the 
proof. 


Finding the order of magnitude of the error term in the asymptotic expression 
of Theorem 1.3.5 is called the circle problem in number theory (when m = 2). 
It appears to be quite difficult. A history of results on the subject can be found 
in Hejhal [262, p. 450]. Theorem 1.3.5 is a special case of Weyl’s result on the 
asymptotic distribution of the eigenvalues of the Laplacian on a compact domain 
(see Weyl [730, Vol. I, pp. 393-430, Vol. IV, pp. 432—456]). We will consider a 
non-Euclidean analogue of Theorem 1.3.5 in Chap. 3. 

We take the following exercise from Bogomolny [46, p. 10]. 


Exercise 1.3.11 (A Trace Formula for Billiards on a Square). The eigenvalues of 
A for periodic boundary conditions on a square in the plane are 2, = — (27)? ||n||7, 
for n € Z?, since periodic eigenfunctions have the form exp (27i‘nx), for x € R?. 
Periodic or closed geodesics in the square with sides identified (otherwise known 
as the torus R?/Z7) are lines of rational slope m;/m2, with m € Z*. The length 
of such a geodesic is ||m||. Why? If the fraction is not in reduced form and d = 
gcd(m,,m2) > 1, then we view the geodesic as not what we later call “primitive.” 
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It is the dth power C? of a primitive geodesic C; i.e., C traversed d times. Suppose 
that h is a Schwartz function on Rt. Show that 


E (Qr)? Ill?) =a [He xjdx+ = 


neZ 


z f" ho (||| vx) dx 


T nez- 


You can view this as a trace formula with sums over eigenvalues of —A on the 
torus R?/Z? on the left and sums over lengths of periodic or closed geodesics in 
R?/Z? on the right. These periodic geodesics can be viewed as periodic orbits for 
the free motion of a billiard on the square billiard table, with periodic behavior at the 
boundaries. We drew pictures of some periodic geodesics in R?/Z? in Sect. 1.5.4. 
See Figs. 1.23 and 1.24. 


The Selberg trace formula for [\\H, where H is the Poincaré upper half-plane and 
T = SL(2,Z) proved in Sect. 3.7 will look rather similar to the result of the last 
Exercise. In that case, part of the right-hand side of the trace formula can be viewed 
as the sum over lengths of orbits of non-Euclidean billiards. We drew a picture of 
such a billiard in Sect. 3.7. See Fig. 3.43. 

Next we want to consider an example of Fourier analysis in the real world of 
smog, earthquakes, etc. 


1.3.4 Spectroscopy and the Search for Hidden Periodicities 


Spectroscopy was created by Michelson and Morley in 1887 to develop a length 
standard based on the wavelength of an emission line of an element. In 1892 the 
wavelength of the red line of cadmium became the international length standard. 
This lasted until 1960 when the orange line of krypton-86 replaced the red line 
of cadmium. And Michelson was awarded a Nobel prize in 1907 for this work, 
which has had an enormous variety of applications—from the identification of air 
pollutants in Los Angeles smog to the components of the Venusian atmosphere. 
Fourier analysis was always involved in the theory of spectroscopy. And the 
theory of spectroscopy has certainly been one of the motivating factors leading to 
interesting developments in harmonic analysis (see Wiener [737, pp. 119-—260]). 
But Fourier transform spectrometers were not built until the 1970s because they 
require fast computers and the fast Fourier transform of Cooley and Tukey [108] 
(see Sect. 1.5.1 and Brigham [67]). The graph of observations of spectra of Venus 
(Fig. 1.7), from the article of Bell in Vanasse [689, p. 138], shows that Fourier 
transform spectroscopy has greatly improved resolving power (from 8 cm™! in 1962 
to 0.05cem7! in 1973). 

Here we shall only be able to give a brief glimpse into the fascinating theory of 
spectroscopy. The interested reader could consult any of the following references 
for more details: Bracewell [62], Bousquet [59], S. P. Davis, M. C. Abrams, and 


1.3 Fourier Series and the Poisson Summation Formula 47 


1. MAC DONALD OBSERVATORY, 1962 
210 CM TELESCOPE 
GRATING SPECTROMETER 
RESOLUTION 8 CM 


6200 6300 6400 6500 
STEWARD, (90 CM) 1964, FOURIER SPECTROMETER 


cm 


RESOLVING POWER x I0 
6450 6500 6550 cm~ 


Ill SAINT - MICHEL (193 CM) 1966, FOURIER SPECTROMETER 
R x 100 


6505 


IV PALOMAR, (500 CM), 1973 
FOURIER SPECTROMETER 
RESOLUTION 0.015 CM7! 


6502 6503 6504 6505 6506 6507 6508 6509 


Fig. 1.7 Improvements in the near-IR Venus spectrum due to Fourier spectroscopy: same 
type detectors (cooled PbS) with almost the same NEP used throughout. Curve I by Kuiper 
(1962); Curve II from Connes and Connes (1966); Curve III from Larson and Fink (1975); and 
Curve IV from Connes and Michel (1975). Four strong CO2 Venusian bands are shown in I; 
the rotational structure is resolved in II; IN shows lines from much weaker overlapping banks; 
IV gives a good approximation of the true line profile, together with ever fainter line. Trace IV 
presents approximately 1/800th of the actual spectral range available from the magnetic tape— 
general-purpose computer output (parts of which are obscured by H20) (after Connes and Michel, 
1975). (From Bell’s article in Vanasse [689, p. 138]. Reprinted by permission of Academic Press) 
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Fig. 1.8 The electromagnetic spectrum. The wavelength A is given in units of meters; the 
frequency, v, is given in units of hertz (1 Hz = 1 oscillation per second); and the energy, E, carried 
by a mole of photons is given in joules (4.184J = 1 calorie). The wave number V is expressed 
in units of cm! (read “reciprocal centimeters” or “wave numbers”). (From Harris and Bertolucci 
[255, p. 2]. Reprinted by permission of Oxford University Press) 


J. W. Brault [122], Griffiths [232], Harris and Bertolucci [255], Vanasse [689]. Some 
of the background in crystallography will be discussed in Sect. 1.4. There are also 
many connections with time-series analysis as Wiener notes in [737, pp. 119-260]. 
More information on time-series analysis can be found in Bloomfield [42], Jenkins 
and Watts [328], Kanasewich [336], and Mackey [443, Sect. 18], for example. 
Time-series analysis studies phenomena f(t) that are neither periodic nor decaying 
as t approaches infinity. Thus classical Fourier analysis does not apply, although 
the theory of distributions does allow one to treat such functions. For example, 
given the variable star data below from Bloomfield [42], one may ask: What are the 
hidden periodicities? This question goes back to Schuster [563]. We will consider 
this question briefly after discussing spectroscopy. 

Now let us review the basic ideas of spectroscopy. This is the study of the 
interaction of electromagnetic radiation and matter. Quantum mechanics says that 
radiation emitted at frequency v can move an atom from energy level E4 to E2 
according to Planck’s law: E) — E; = hv. Here v = c/Ay, where c =the speed 
of light, Ay = wavelength. Since c is known to less accuracy, spectroscopists use 
o = 1/A, = the wave number, and not v. Figure 1.8 from Harris and Bertolucci 
[255, p. 2], gives an idea of the wave numbers of various sorts of radiation. 

In order to understand what is going on with spectral lines, you really need to 
know some quantum mechanics, chemistry, crystallography, and group representa- 
tions. We will touch on this in Sect. 1.4 and Chap. 2. 

A Fourier spectrometer uses a Michelson interferometer to divide the beam from 
some source into two separate beams of equal strength. After the two beams travel 
different paths, they are recombined. A signal is obtained which ultimately becomes 
the interferogram function 
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F(x) = f Blo)cos2nox)do, 


where B(o) is the source spectral density at wave number o. Then Fourier 
inversion gives the equation of spectral recovery: 


B(o) =4 f ” F(x) cos(2mox)dx. 


We view F(x) as an even function on the real line here. Since all functions actually 
live on finite intervals, we have to look at 


Fr =F -X_r7; 


where %j-r,r) denotes the indicator function of the set [—T,7]; i.e., 


@afh it st 
et a a eee 


Then the real source spectral density (using the convolution theorem for the Fourier 
transform and Exercise 1.2.12 of Sect. 1.2) is 


Bx(o) = B(o)*(sin(2270X)/m0). 


So spectroscopists call sin(270X)/ao the instrument function. To avoid the 
Gibbs phenomenon, one may modify the instrument function using methods 
analogous to those mentioned in Exercise 1.3.2. The spectroscopy literature usually 
does this by replacing Xx xj(t) by %j-x,x(t)(1 — |t|/X). This replaces By by 
Bx(sin(w#0X)/20X)?. Physically this “apodization” can be obtained by putting on 
suitable apertures. 

In fact, one can only sample F to reconstruct B and thus Shannon sampling 
(Exercise 1.3.4) enters the picture. Suppose that the samples of F are taken at 
nd, n=0,1,2,3, .... Then if B(o) = F(o) vanishes for |o| > Om, we need d = 
1/20, from the sampling theorem. One says that a sampling distance d > 1/20, 
makes the spectrum free of aliasing. 

Periodic sampling errors can lead to“satellites” near strong spectral lines, as 
in Fig. 1.9. Such errors come from defects in the equipment. Let us consider a 
theoretical explanation. 

Figure 1.9. means B(o) = 2Cd(o — co), with C a constant. Then F(x) = 
Ccos(2m00x). Suppose that F is sampled with periodic errors. Then x would be 
replaced by x’ with x’ = x + B cos(27ex). If B is small then 


F(x’) = C {cos(2m0x) — xB op sin[27 (op + €)x] — TB op sin[27(oo — €)x]} . 


This gives two satellites as pictured in Fig. 1.9. 
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Fig. 1.9 Satellites caused by sampling errors 


Finally, let us make a few connections with time-series analysis. The remarks 
go back to Wiener [737, pp. 119-260]. If E(t) is the electric field and 6 the path 
difference of the interfering beams in the spectrometer, then the intensity or mean 
power is, for T = 6/c, 


ag(t) = lim or |, EME oar 


This expression is called the autocovariance or autocorrelation of E and plays 
a large role in time-series analysis. Strictly speaking, it should be normalized to 
a(t)/a(O). It is the Fourier transform of the power spectrum by the Wiener- 
Khintchine formula. To see this formally, suppose ¥/_7,7) is the indicator function 
of [—T,T] as above. Set Er = E- %j-r,r}. Then (under appropriate hypotheses on £) 


1 ptr 
ag(T) = jim = r E(t)E(t +1) dt 
1 HT He, 
= lim — Er(t) Er(w)exp[2riw(t + T)] dw dt. 
T=» 2T J-T es 
i 1 ake ; A 2 
= jim z7 - exp(2ziwT)|Er(w)|~ dw. 


This is the desired formula since the power spectrum is 
P(w) = lim |Er(w)|?/2T. 
T— œ 


But what does all this have to do with the hidden periodicities in the variable 
star data of Fig. 1.10 from Bloomfield [42, p. 3]? This is explained quite well 
by Bloomfield [42, Chap.2]. The most natural method to try in searching for 
periodicity is to model the 600 data values x,, for t = 0,1, ...,599 by the function 
(u +Acos(@t)+Bsin(@t)) or by a sum of such terms. You could, for example, 
guess œ, and then choose u,A,B to minimize 


599 
S(u,A,B) = ¥ (x — u — Acos (œt) — Bsin(ar))’. 
t=0 
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Fig. 1.10 Magnitude of a variable star at midnight on 600 successive nights. (From Bloomfield 
[42, p. 3]. Reprinted by permission of John Wiley & Sons) 


This is the method of least squares. If you count the 21 peaks in the data, you could 
guess the period to be 600/21 = 28.6. This leads you to take @ = 27 /28.6 = 0.220. 
The method of least squares then gives, if you ignore certain terms, 


1 
u = y= (tot +n), n=599, 


A(@) = ZE —¥) cos (ot), 
B(@) = “i -¥) sin (or). 


So we are approximately looking for the discrete Fourier transform. 

In order to obtain further information on the periods, one forms the periodogram 
n(A? + B?)/2. The graph of this function is shown in Fig. 1.11 ( from Bloomfield 
[42, p. 19]). The maximum occurs at œ = 0.21644. If one then does a least 
squares analysis using this value of œ, one obtains the graph of Fig.1.12 for 
x — (U +A cos(@t) +B sin(@t)) (from Bloomfield [42, p. 21]). The graph has 
an obvious period of 24 days. The new @ is thus about 0.262. Thus one concludes 
that the original variable star data had two periodic components. 

It should be noted that the Shannon sampling theorem affects the choice of a. 
If the sampling interval is d = 1 day, then 0 < œ < 7 /d. Every frequency not in the 
range has an alias in the range. 

In Chap.5 of Bloomfield [42] filtering methods are used to show that x399 in 
the variable star data is in error, with x399 = 18 and not 19. This rather impressive 
deduction does not appear to be in the realm of possibility for other examples 
considered by Bloomfield; e.g., European wheat prices for 1500-1869. 
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Fig. 1.11 Periodogram of the variable-star data for frequencies @, 0.20 < œ < 0.24. (From 
Bloomfield [42, p. 19]. Reprinted by permission of John Wiley & Sons) 
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Fig. 1.12 Variable-star data with fitted sinusoid subtracted. (From Bloomfield [42, p. 21]). 
Reprinted by permission of John Wiley & Sons 
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1.3.5 Poisson’s Sum Formula as a Trace Formula 


Our next goal is to obtain an interpretation of Poisson’s summation formula in terms 
of traces of integral operators. In order to do this we shall briefly sketch the theory 
of integral operators on L?(D), where D is a measurable subset of R”. References 
for this subject include Courant and Hilbert [111], Dieudonné [134], Lang [387], 
Maurin [458], Stakgold [618,619] and Yosida [748]. 


Definition 1.3.3. Let K € L? (D x D). Then the integral operator Kx defined by 


Lefi) = f Ky) F)dy (1.10) 
D 


is called a Hilbert-Schmidt operator with kernel K. 


Actually, more general Hilbert-Schmidt operators are considered in the refer- 
ences. Clearly Lx maps functions f in L?(D) to functions Lx f in L?(D). And just 
as clearly, the map Lx is linear. So we call it an “operator,” which just means linear 
map. 


Definition 1.3.4. An operator L : H — H, on a Hilbert space H, is said to be 
self-adjoint if (Lf,g) = (f,Lg), for all f,g in H. Here (f,g) denotes the inner 
product in the Hilbert space H. 


Note that the Hilbert-Schmidt operator (1.10) is self-adjoint if and only if 
K(x,y) = K(y,x), for almost all x,y € D. (1.11) 


Definition 1.3.5. An operator L : H — H, on a separable Hilbert space H, is 
compact if for every bounded sequence xn in H, the sequence Lx, has a convergent 
subsequence. 


Theorem 1.3.6. A Hilbert-Schmidt operator on a compact domain is a compact 
operator. 


For a proof, see Yosida [748, pp. 227—278], or any of the references above, some 
of which consider only special cases. 


Exercise 1.3.12 (Examples of Kernels). 


(1) Let D = (0,1) and K(x,y) = |x—y|~4, 0 < a < 5. Show that this is the kernel 
of a Hilbert-Schmidt operator on L*(D). And show that, if instead 5 <a<l, 
then the operator is no longer Hilbert-Schmidt, although it is compact. 

(2) Show that the kernel exp(—x? — y?) generates a Hilbert-Schmidt operator on 
L?(R), while the kernel exp(—|x — y|) generates an operator that is not even 
compact although it is bounded on Z? (R). 


Hint. See Stakgold [619, pp. 353-354]. 
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Theorem 1.3.7 (The Spectral Theorem for Compact Self-Adjoint Operators). 
Suppose that the operator L: H — H is a compact and self-adjoint on a separable 
Hilbert space H. Then H has a complete orthonormal set of eigenvectors {vn} of 
L with Lvn = AnVvn, An € R, n = 1,2,3, .... Every vector v in H has a generalized 
Fourier series representation such that 


v= > (v,Vn)vn, 


n>1 


Lv = 5 An(V, Vn)Vn. 


n>1 


For a proof of the spectral theorem see the references above. The idea is that 
one can obtain the eigenvalues of L by finding maxima of the quadratic form (Lv, v) 
for v in H with ||v|| = 1. This method can actually be put on a computer, as the 
Rayleigh—Ritz and finite element methods. See Strang and Fix [641] for a more 
complete story of the finite element method. 


Exercise 1.3.13 (Expansion of the Kernel of a Self-Adjoint Hilbert-Schmidt 
Operator). Suppose that K(x,y) is the kernel of a self-adjoint Hilbert-Schmidt 
operator Lx given by (1.10). Let {v,} be a complete orthonormal set of eigenvectors 
of Lx as provided by Theorem 1.3.7, with Lx vn = ÀAnvn. Show that the series 


X, AnVn(x)vn(y) 


n>1 


converges to K(x,y) in the Z? norm. 


If the trace of the self-adjoint integral operator Lx in (1.10) has any meaning, it 
must be the infinite sum > A, of all the eigenvalues å, of Lx. Let us suppose that the 
operator Lx is positive; i.e., that (Lx f, f) is positive for all f Æ 0 in H. This means 
that all the eigenvalues of Lx are positive. Then we have the following theorem. 


Theorem 1.3.8 (Mercer’s Theorem). Suppose that Lx is a positive self-adjoint 
Hilbert-Schmidt operator (1.10) with a continuous kernel K on a compact set in R”. 
Then 


K(x,y) = E Avr) vn), 


where the vn and A, are as in Theorem 1.3.7, and the convergence of the series is 
absolute and uniform. 


This is proved in the references. 


Exercise 1.3.14 (The Trace of a Positive Self-Adjoint Hilbert-Schmidt 
Operator). Suppose that the Hilbert-Schmidt operator Lx satisfies the hypotheses 
of Theorem 1.3.8. Show that 


Trace Lk = ¥ n = l K (x,x)dx. 
D 


n>1 
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We want to apply this last exercise to deduce Poisson’s summation formula. 
Suppose that f : R” — C is a Schwartz function. Define an integral operator on 
functions f in LZ? (R” /Z") by 


Lyg(x) = (f * 8)(x j= [ fe- yg (1.12) 


Then the functions en(x) = exp(2ai'nx), n € Z”, give a complete orthonormal set 
of eigenfunctions of Ly on L? (R” /Z”). To see this, note that 


Lyenls) = (fxe = |. fle—y)exp(2ni'ny)dy 
= f _f(v)exp(2ni'n(x—v))dv = f(nen(2). (1.13) 
On the other hand, we have 
Lies) = f gp KEIO) 
with 


Ky(x,y)= $, f(x-y—n). (1.14) 


nezm 


Now we can use Exercise 1.3.14 to say that 


Trace L= $, f(n) = f ope E (1.15) 


neg™ 


But this last integral is easily seen to be | f(n), by formula (1.14 ). Thus (1.15) 
is really Poisson’s summation formula of Theorem 1.3.2. This discussion comes 
from a seminar talk of Larry Verner in 1973, though it was certainly well known 
to Selberg as early as 1951. In Chap. 3, we will begin consideration of Selberg’s 
trace formula with an analogous analysis. Our main interest then will be in integral 
operators on noncompact domains, however. 


Exercise 1.3.15 (Band- and Time-Limited Functions—Another Look at 
Uncertainty). Slepian and Pollak [605] note that Exercise 1.2.16 of Sect. 1.2 
does not really tell you “just how close one can come to simultaneously limiting in 
both time and frequency.” Consider f in L? (R). Set F (w) = f(w/2z). If F (w) = 0 
for |w| > B, we shall say that f is band-limited, and write f € Bg. Show that the 
function f in 6g with || f||2 = 1 and 


+T 
/ | f(t)|?dt = maximum 
-T 
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Table 1.2 Values of A,(c) = Ln(c) x 107?” (from Slepian and Pollack [605, p. 4].) 


c=0.5 c=1.0 c=2.0 c=4.0 c=8.0 
n L p L p L p L P L p 
(0) 3.0969 1 5.7258 1 8.8056 1 9.9589 1 1.0000 0 
1 8.5811 3 6.2791 2 3.5564 1 9.1211 1 9.9988 1 
2 3.9175 5 1.2375 3 3.5868 2 5.1905 1 9.9700 1 
3 7.2114 8 9.2010 6 1.1522 3 1.1021 1 9.6055 1 
4 7.2714 11 3.7179 8 1.8882 5 8.8279 3 7.4790 1 
5 4.6378 14 9.4914 11 1.9359 T 3.8129 4 3.2028 1 
6 2.0413 17 1.6716 13 1.3661 9 1.0951 5 6.0784 2 
7 6.5766 21 2.1544 16 7.0489 12 2.2786 7 6.1263 3 
8 1.6183 24 2.1207 19 2.7768 14 3.6066 9 4.1825 4 


is the eigenfunction corresponding to the largest eigenvalue of the integral operator 


ae sin[B(t — s)| 
LA) = [Feds 
Hint. Consider the operators 
1 (t8, 
Bf(t) = zl, f(w/2r)exp(itw)dw, 
and 
_Jf®©, if ST, 
= h if l| >T. 


Show that BT f = Lf. 


Note. Slepian and Pollak calculate the eigenvalues of L by showing that the 
corresponding eigenfunctions must be spheroidal wave functions. These results have 
been generalized to some symmetric spaces by Grünbaum et al. [234]. Slepian and 
Pollak give a table of values of A,(c) (Table 1.2) where c = BT /2. 


Some of the most interesting applications of the spectral theorem for integral 
operators are in partial differential equations. This occurs because one can often 
write the inverse operator of a differential operator D (or of D — ul, I =identity, u € 
C) as an integral operator. The kernel of this integral operator is called the Green’s 
function. For example, the Green’s function G(x,y) for —A on a region D in R? 
with the (Dirichlet) boundary condition that f vanish on the boundary dD satisfies 


(THE) = Ip Guy) FO) ay, 

—ATf =f, forall f in L*(D), 

—TAf=f and f such that both f and Af are in Z? (D) 
and such that f vanishes on the boundary dD. 


(1.16) 
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Equations (1.16) say that 


—A,G(x,y) = 6(x—y), for x,y in D 
Dae (1.17) 
G(x,y) =0, ifxis in OD. 


Thus Exercise 1.1.6 of Sect. 1.1 implies that in 3 dimensions 
-1 
G(x,y) = (4r |x= yl) +h(x,y), 


where (x,y) has no singularities and is chosen to make G(x,y) vanish if x lies on 
the boundary of D. Finding G(x,y) explicitly is difficult or impossible unless the 
region D is nice. For example, sharp spikes are not allowed in the region D. The 
following exercise gives a simple example of a Green’s function for the Laplace 
operator on a ball of radius r. You can find more examples and discussions of 
Green’s functions in Sect. 2.2 and in the following references: Courant and Hilbert 
[111], Dunford and Schwartz [145, Vol. II, Chap. XII] (where Green’s functions 
are called resolvent kernels), Garabedian [197], Morse and Feshbach [479, Vol. 
II, Chap. 10], and Stakgold [618,619]. We will have more to say about Green’s 
functions in Chaps. 2 and 3. 


Exercise 1.3.16 (Green’s Function for —A on a Ball of Radius r by the Method 
of Images). William Thompson developed this method and applied it to problems 
in electro- and magneto-statics. The idea is to use the map of R? which sends a 
point x € R? to its inverse point x* with respect to the sphere of radius r. By this, 
we mean that x* lies on the same radial line from the origin as x and ||x||||x*|| = 1. 
The problem is to show that the Green’s function for the ball of radius r in R? is 


G(x,y) = (4a) (x= yl rly ley" 1). 


Hint. To show that G(x,y) = 0 when ||x|| = r, you must use the fact that 


_— +* 
Bove jar 
llx—yll lll 


If the Green’s function G(x,y) exists for the region D, then the operator T in 
formula (1.16) will be a Hilbert-Schmidt operator which means that both T and —A 
will have a complete orthonormal set of eigenfunctions v, in L?(D) with —Av, = 
AnVn and 


Gay) = >A, Valen). (1.18) 


n=] 


In general, the convergence of this series is only in the L* norm. This gives another 
method for finding the Green’s function. However, even in the simple example 
below, one finds a series which does not converge absolutely. 
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Exercise 1.3.17 (Green’s Function for —Aon a Rectangular Solid). Consider the 
Dirichlet eigenvalue problem 


ee O<x <a,0<y<b,0<z<c 


v=0 on the boundary of the rectangular solid. 


Find a complete orthonormal set of eigenfunctions and consider formula (1.18) 
for the Green’s function. Show that the series does not converge absolutely, using 
information about Epstein’s zeta function (Theorem 1.4.1 of Sect. 1.4). You can find 
a discussion of the method of images for this problem in Courant and Hilbert [111, 
pp. 378-384]. 


1.3.6 Schrödinger Eigenvalues 


As a last topic in this discussion of spectral theory, we want to consider a 
Schrédinger eigenvalue problem in one dimension: 
y(x) E L(R). l 


This is a good, though simplistic, model of a quantum-mechanical problem (see 
Messiah [465, Vol. I, Chap. XII]), which indicates quite clearly that a continuous 
problem can sometimes have a only a discrete sequence of possible eigenvalues. 
In quantum theory this is thought of as saying that the system is quantized and can 
only be in a certain discrete list of states. The eigenvalues of the differential operator 
in (1.19) correspond to the energy levels of the system, once everything has been 
normalized in the proper way for physics. We have chosen our normalization to 
connect with the theory of Fourier integrals. If the problem were on a finite interval, 
then we could find a Green’s function and use Hilbert—Schmidt theory to see that the 
spectrum is discrete. However, this problem is singular, because it is on an infinite 
interval. Thus the spectrum has no a priori reason to be discrete. And certainly 
there are other Schrödinger eigenvalue problems with mixtures of continuous and 
discrete spectra. An example is the eigenvalue problem associated with the hydrogen 
atom which is considered in Sect.2.1. Another such eigenvalue problem will be 
discussed in Chap. 3. 

We could approach the eigenvalue problem (1.19) by substituting y(x) = 
w(x) exp(—zx?). The resulting differential equation for w(x) can be seen to have a 
polynomial solution only for special values of A. However, we shall take a different 
approach—the factorization method of Infeld and Hull [316] (see Talman [655]). 
Set D = d/dx. Then we can factor the operator 


H = D? — (2nx)* = (D—2ax)(D+2nx) — 27 = (D+2nx)(D—2nx) +27. 
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Define the raising operator A = D — 27x and the lowering operator B = D + 27x. 
Then H = AB — 27 = BA + 27. 


Exercise 1.3.18 (Hermite Functions Are Eigenfunctions of H and the Fourier 
Transform). 


(1) Show that if we define yọ = exp(—7x*), Yn = Ayn—1, then Hyn = Anyn, where 
the eigenvalues are A, = —(4n+2)z. 

(2) Show that if f = the Fourier transform of f, then f, = (—i)"yn. 

(3) Show that y,(x) = (D — 27x)” exp(— nx?) = exp(—mx?)D" exp(—27x’). 

(4) Show that y,(x) is the product of exp(—zx7) and a polynomial of degree n. 

Exercise 1.3.19 (Spectral Decomposition of the Schrédinger Operator H). Set 

Va = llyally Yn with y, as in Exercise 1.3.18. Show that v, gives a complete 


orthonormal set for Z? (R) and thus the spectral decomposition of the differential 
operator H = D? — (27x)?. 


Hint. The functions vn are pairwise orthogonal since they are eigenfunctions of the 
self-adjoint operator H corresponding to distinct eigenvalues. To see that they form 
a complete orthonormal set, you must show that if f € L?(IR) and (f vn) = 0 for all 
n, then f is zero (almost everywhere). One way to do this is to look at 


(Foyexp(—ay*)) (x) = f £0) exp(—2mixy— my" )dy 


= y AA Syfel): 


n>0 


If (f,¥n) = 0 for all n, this must vanish. Why? 


Exercise 1.3.20 (Connection with the Classical Hermite Polynomials). Define 
H,,(x) = (—1)" exp(x”)D" exp(—x), where D = d/dx. 


The H,,(x) are the classical Hermite polynomials (see Courant and Hilbert [111, 
pp. 91-97]). Find the formula connecting y, and Hy. 


Exercise 1.3.21 (A New Expression for the Fourier Transform on 17(R)). This 
idea comes from Wiener [736, pp. 51-71]. Suppose that the functions v, are as in 
Exercise 1.3.19. Then any function f in L*(IR) has an expression f = ¥(f,Vn)Vn- 
What is the corresponding expansion for the Fourier transform of f ? 


Exercise 1.3.22. Find an example of f € L! (R) such that f € L! (R), and f(n) =0 
forall ne Z; f(0)=1, f(n) =0, ifn €Z, n £0. This shows that the Poisson 
summation formula is not always valid when you might expect it to be. 


Hint. See Katznelson [347, Exercise 15, pp. 130-131]. 
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Exercise 1.3.23 (Periodic Green’s Function for the Helmholtz Equation in R3 
by the Method of Images). 


(1) Show that g(x,y) = exp(ik||x — y||)/ (47||x— ||) is a Green’s function for the 
Helmholtz operator (A — k?) on R°. Other choices for g(x,y) are 


exp(—ik||x — yl|)/ 4rl||x— yll), cos(kllx— yll)/ 4rllx— yll), 


etc. Our choice of g(x,y) dies off exponentially at infinity, if Imk > 0. It can 
be viewed as corresponding to outgoing waves in the wave equation. 
(2) Use the method of images to obtain the Green’s function for (A— ke) on the 
unit cube R? / Z corresponding to periodic boundary conditions. 
Answer. 


1 y exp(ik ||x +n — y]]) 


G = 
K(X, y) 4r x+n- yli 


neZ? 


For what values of k does this series converge? 
(3) Show that, as in formula (1.18), we can represent the Green’s function in part 
(2) by the following series converging in the L?-sense: 


Gx (x,y) = £ 


neZ3 


exp(27i ‘n(x — y)) 
||27n||2 — k2 


For absolute convergence, show that one can look at G — Gm, since if Ry = 
(A—22) 71, then Ry — Rm = (K? — m?) Rin. 
(4) What does Poisson summation do to the formulas in parts (2) and (3)? 


1.4 Mellin Transforms, Epstein and Dedekind Zeta 
Functions 


It is remarkable that the deepest ideas of number theory reveal a far-reaching resemblance 
to the ideas of modern theoretical physics. Like quantum mechanics, the theory of numbers 
furnishes completely nonobvious patterns of relationship between the continuous and 
the discrete (the technique of Dirichlet series and trigonometric sums, p-adic numbers, 
nonarchimedean analysis) and emphasizes the role of hidden symmetries (class field theory, 
which describes the relationship between prime numbers and the Galois groups of algebraic 
number fields). One would like to hope that this resemblance is no accident, and that we are 
already hearing new words about the World in which we live, but we do not yet understand 
their meaning. 

—From Yu. I. Manin, as translated by Ann and Neal Koblitz, Mathematics and Physics, 
Birkhauser, Boston, 1981. 


1.4.1 Mellin Transforms 


In this section we shall consider a method used both by number theorists and 
physicists to obtain analytic continuations of Dirichlet series. Riemann used this 
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procedure in [542, pp. 145ff], to obtain the analytic continuation of the Riemann 
zeta function: 


(i= n", Kero (1.20) 


to the entire complex s-plane. Ewald used the same method to compute potentials 
of crystal lattices (see Born and Huang [57, p. 389]). Riemann’s method can also 
be used to compute Green’s functions for a rectangular parallelepiped (see Courant 
and Hilbert [111, pp. 378-384], and Exercise 1.3.23 of Sect. 1.3 ). We shall discuss 
these examples in this section, as well as the Dedekind zeta function of an algebraic 
number field. 

Before considering Riemann’s method we must understand another important 
integral transform. 

The Mellin transform of f : Rt — C is 


Mf(s) = [ “f(y tay. 


The Mellin transform is as well suited to the study of the multiplicative properties 
of numbers as the Fourier transform is well suited to the study of the additive 
properties of numbers. In fact, the change of variables y = e* allows you to go 
from one transform to the other. You could also consider the Mellin transform as a 
two-sided Laplace transform. 


Exercise 1.4.1 (Inversion of the Mellin Transform). Make the change of 
variables y = e* to see that Mf(s) = F(t/27), where s = o + it, with o,f real, and 
F(x) = f(e*)e®. Suppose that F(x) € L! (R) and that f(y) is piecewise continuous 
with one-sided derivatives always existing. Use Exercise 1.2.7 of Sect. 1.2 to show 
that 


eti 


z (FO+) +0) = se tim [ya p(o)as. 


Exercise 1.4.2 (Properties of the Mellin Transform). Prove the following prop- 
erties, assuming that the functions that are Mellin transformed satisfy the conditions 
of Exercise 1.4.1, for example. 


(a) Set D = yd/dy. Then MDf(s) = —sMf(s). 
(b) {Mf(s) = M(f(y) logy). 


(c) M(f3)(s) = zii Jre =c Mf (z)Ma(s — z)dz. 
(d) For f,g : R + C, define (f * g)(y) = Jy’ f(v/u)g(u)u du. Then 


M(F *8)(s) =Mf(s)Mg(s). 


(e) Set fa(y) = f (ay), for a and y in R*. Then M(fa)(s) =a *M f(s). 
(f) Jre s=cMF(s)a(s)ds = So FOM! (yy dy. 
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Table 1.3 A short table of Mellin transforms 


f(y), y>0 Mf(s) 
exp(—ay), y>0 a“*T(s), Res>0 
ico y>a T(s,a) 

0, O<y<a 
exp [3 (v+4)],a>0 2K; (a) 
exp(—zy)(I+y)*!, Rez>0 T(s)'¥(s,a+s,z), Res >0 


Exercise 1.4.3. Verify Table 1.3. 


Hint. Here you need to know a little about special functions such as gamma, 
incomplete gamma, K-Bessel, and confluent hypergeometric functions. We will have 
much more to say about these functions as we continue our discussion. You can 
view gamma and incomplete gamma as being defined by the first two lines of this 
table. See Theorem 1.4.1 for an application of T (s,a). The K-Bessel function K;(a) 
is discussed in Sect. 3.2. Confluent hypergeometric functions will appear in Volume 
II [667]. See, for example, Lebedev [401, Chaps. 1, 5, 9]. A longer table of Mellin 
transforms can be found in Erdélyi et al. [165] or Oberhettinger [498]. 


More information on Mellin transforms can be found in Sneddon [609] and 
Titchmarsh [678]. You could, of course, also consider Mellin transforms of 
distributions. 

It is possible to view the Mellin inversion formula in Exercise 1.4.1 as the 
spectral resolution of the singular differential operator (yd/dy)* acting on 
functions in the space 


L?(R*,y7!dy) = 7 :R*—C measurable 


f IFO Py dy < =}. 


The functions y* are all eigenfunctions of this differential operator. See Stakgold 
[618, pp. 465-466], for an exercise on the Mellin inversion formula from this point 
of view. In Chap. 3 and Volume II we will generalize the Mellin inversion formula 
to spaces of positive n x n matrices. This is, in fact, one of our main objectives 
in writing these volumes. Note that the operator (yd/dy)* and the measure y~!dy 
are invariant under the change of variables w = ay for a € R*. Thus we are really 
viewing a spectral decomposition that is intimately related to the multiplicative 
group of positive real numbers when we study the Mellin inversion formula. The 
generalizations of Mellin inversion to be considered in Chap.3 and Volume II are 
directly related to the general linear group of n x n nonsingular real matrices. 
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All of the analytic continuations of zeta functions that we want to consider are 
related to the analytic continuation of Epstein’s zeta function. The latter lives on 
the symmetric space P, of positive definite symmetric n x n real matrices and it will 
appear in the study of harmonic analysis on P, in Chap. 3 and Volume II [667]. 


Definition 1.4.1. Epstein’s zeta function of Y € P, and s € C, with Res > n/2 is 


zY) = 5 X Yf”. 
0 


acZ"— 


Here Y [a] = ‘aYa, thinking of a as a column vector, with ‘a =transpose of a. Thus 
Y [a] is the quadratic form 


n 
Y [a] = 5x yijaiaj, if Y = (yij), a= '(a1,...,an). 
ij=1 


In the special case that n = 1, then Y = y € R*, and Z(Y, s) = y *C€(2s), where 
¢=Riemann’s zeta function. See Edwards [149] for a fascinating treatment of the 
work on that zeta function, whose main importance for number theory comes from 
the following formula: 


Cis)= T] G-p), Res>1. (1.21) 


p=prime 


Exercise 1.4.4. Prove formula (1.21) which is called the Euler product for 
Riemann’s zeta function. You need to know that every natural number n € Z* can 
be factored uniquely (up to order) as a product of primes. 


Many other special cases of Epstein’s zeta function arose in number theory 
before Epstein’s papers which appeared in 1903 and 1907 (see Epstein [163]). 
Most of these cases arise from the fact that the Dedekind zeta function of an 
algebraic number field can be written as a finite sum of integrals of Epstein zeta 
functions (Theorem 1.4.2). The sad story of the end of Epstein’s life and the horrors 
of Hitler’s Germany can be found in Siegel [600, Vol. III, pp. 464-470], or the 
translation [601]. 

There are many applications of Epstein zeta functions in statistical and solid-state 
physics. We will discuss some of these at the end of this section. An interesting 
reference that gives applications of Epstein’s zeta function in quantum-statistical 
mechanics is Hurt and Hermann [310, Chap. 8]. Many applications in physics are 
to be found in Kirsten and Williams [354]. 


Exercise 1.4.5 (Convergence of Epstein’s Zeta Function). 


(a) Use Theorem 1.3.5 of Sect.1.3 to show that Epstein’s zeta function does 
indeed converge absolutely and uniformly on compact subsets of the half-plane 
Res > n/2. 
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(b) Develop an integral test in several variables to do the convergence proof in 
another way. We will return to this topic in Volume IT [667]. 


Our next goal is to analytically continue Epstein’s zeta function to the whole s- 
plane as a meromorphic function of s, having its unique pole at s = n/2. The method 
goes back to Riemann ([542, pp. 147 ff]) and this paper is translated and discussed 
in Edwards [149]. Riemann did not mention incomplete gamma functions, but he 
did, in fact, obtain this expansion when n = 1. 


Theorem 1.4.1 (The Analytic Continuation of Epstein’s Zeta Function). Let 
T(s,x) be the incomplete gamma function as in Exercise 1.4.3 and set 


G(s,x) =x “*T(s,x) =i y! exp(—xy)dy. 


Then Epstein’s zeta function can be analytically continued to all s € C with its 
only pole a simple one at s = n/2 having residue 4a"? YPT (n/2)!. Here |Y| 
denotes the determinant of Y. 


The analytic continuation comes from the incomplete gamma expansion: 


A(Y,s) = wT (s)Z(Y,s) 


= Ll S (afsala) +126 (2 -s,aY-'ta])). 


2s—n 2s 2 cE -0 


Thus Epstein’s zeta function satisfies the functional equation 
A(Y,s) = |Y 12A (Y7! n/2—s). 
Furthermore, Epstein’s zeta function takes on the values 


1 
2? 


Z(Y,0) = Z(Y, -k) =0, k=1,2,3,.... 


Proof. This demonstration will be broken up into two exercises. 


Exercise 1.4.6 (The Transformation Formula of a Theta Function). For Y in 
Pn, t € RF, define 


0(Y,t)= > exp(—zY[a)r). 


acz” 


Show that the series converges and that 


O(Y,t) = |Y 12r "/20(Y7!,t!) with |Y|= determinant of Y. 
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Hint. Use the same method as in Exercise 1.3.8 of Sect. 1.3. To do this you must 
write tY = B?, with B € Py. Then set f(x) = g(Bx). The Fourier transforms of f and 
g are related by 


F(x) = [BI 1g"). 


Note. The series for theta converges quickly for large t. When ¢ is near 0, the 
transformation formula allows one to replace ¢ by t~! and obtain a quickly 
convergent series again. We used this fact already in Theorem 1.3.5 of Sect. 1.3. 
It will be used over and over again in these notes. 


Exercise 1.4.7 (Riemann’s Trick). 


(a) Show that for Res > n/2: 


A(Y,s) = af OYA) Dar 


You can justify the interchange of summation and integration by Fubini’s 
theorem (see Lang [387, p. 295]). 
(b) Obtain the analytic continuation of Epstein’s zeta function by writing 


oo 1 oo 
Go 
0 0 1 
Then replace ¢ by 1/t in the first integral and use Exercise 1.4.6. This should 
lead you to the formula 


A(Y,s) = | w(¥,t)dt-+|¥|-2 f "2-8! pdt 


a l 
+ zS C 
1 


where w(Y,t) = 5(O(¥ ,t) — 1). Then interchange sum and integral in the first 
two integrals and evaluate the last integral. 

(c) Show that the incomplete gamma expansion of Z(Y,s) converges exponentially 
faster than the original Dirichlet series defining Z(Y, s). 


Hint. The incomplete gamma function has the asymptotic expansion 


sige (1 | s—1 (s— 1)(s—2) 


t z t), as x —> œ. 


G(s,x) ~x 
x x 


The proof of Theorem 1.4.1 is complete once you have done the preceding 
exercises. Much use has been made of such incomplete gamma expansions in 
number theory (see Montgomery and Vaughan [475] and Lavrik [398]). Our next 
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goal is to apply the result to study the Riemann zeta function in the interval (0, 1). 
We do this in the next set of exercises, which go back to Riemann again (see 
Edwards [149, pp. 16—18]). Selberg generalized these exercises to the symmetric 
space P, in order to obtain the analytic continuation of generalizations of Epstein’s 
zeta function known as Eisenstein series. We will consider this in Volume II [667]. 


Exercise 1.4.8 (Multiplication Invariant Differential Operators on the Positive 
Reals). 


(a) Let L be a differential operator acting on functions f : Rt —> C. Suppose u : 
R* — R* is differentiable with a differentiable inverse. Define L” by L" f = 
L(fou)ou™!. Let a € R and Da = y“(d/dy)y'~*. Then define c(y) = cy for 
c € R”. Show that DS = Da; i.e., Da is multiplication invariant. 

(b) Suppose that L is a multiplication invariant differential operator on Rt. Define 
its formal adjoint L* by 


f (LAO) ‘dy = l SOL Oy dy. 
Rt Rt 


Note. dy/y is the multiplication invariant or Haar measure on the group of 
positive real numbers under multiplication. Show that if u(y) = 1/y, then LY = L*. 


(c) Use integration by parts to prove that D% = —Do_,. 


Exercise 1.4.9 (Transformation Formula of a Differentiated Theta Function). 
Set L = Dı /2Dj, using the notation of Exercise 1.4.8. Show that if 0 (y,t) is the theta 
function of Exercise 1.4.6, when n = 1, and t, y € R™, then 


L,0(y,t) =y720-2L,A(y 17). 


Exercise 1.4.10 (Power Series for ¢ (2s)around s = D. If L = Dj /2Dj, then L* = 
D,D3/2 and L*t* = s(s— Fes. Then replace 0 by L,0 in the proof of Theorem 1.4.1 


to show that 
2s(s— 5)xT(s)6 (2s) 
= AYpsi n fe, (r+) (2ntn? — 3) exp(—men?)at. 
Then show that we can expand the right-hand side as a power series in even powers 
of s— } with positive coefficients. Deduce that ¢ (s) < 0 for 0 < s < 1. 


The Riemann hypothesis says that ¢ (s) has its only zeros at 
s = —2,—4,—6,... (the trivial zeros) 
and at points on the line Res = 7 a line that is fixed by the functional equation 


of the zeta function. The hypothesis had been checked by computer for the first 
many million zeros by 1979 (see Brent [65] and Edwards [149]). By 2004 X. 
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Gourdon and P. Demichel had checked the Riemann hypothesis up to the 10!°th 
zero (see Ed Pegg, Jr.’s article called “Ten Trillion Zeta Zeros” at the website 
www.maa.org/editorial/mathgames). You win a million dollars if you have a proof 
of the Riemann hypothesis. Don’t send the proof to me, please. See the Clay 
Math. Institute website (www.claymath.org). However, the proof or disproof of this 
hypothesis has so far skillfully eluded its many pursuers. In Volume II [667] we will 
discuss the evidence for the Riemann hypothesis coming from a study of spacings 
of the zeros (see also the paper on Odlyzko’s website (www.dtc.umn.edu/~odlyzko/ 
doc/zeta.htm). 

Epstein’s zeta function behaves very differently from Riemann’s in some ways. 
For example, there are many matrices Y € P such that Z(Y,s) vanishes for some 
s with Res > 1 (see Davenport and Heilbronn [121]). And the following Exercise 
shows that the behavior of Z(Y,s) for s in the interval (0,1) is very different from 
that of Riemann’s zeta function when Y has a small minimum 


my = min {Y [a] =‘aYa|aeZ*-0}. 


Exercise 1.4.11. Use the incomplete gamma expansion of Z(Y,s), Y € Po, in 
Theorem 1.4.1 to graph the function A(Y,s) for x in (0, 1) and 


t 0 
Y= = 1,.1,.01,.001. 
Gat ,-1,.01, .00 


Hint. You can compute the incomplete gamma functions using Mathematica or 
Matlab or whatever is your favorite mathematical software. Compare with Fig. 1.13. 


Historical Note. 

Back in the day (more specifically, the 1970s and 1980s) we had to write our 
own programs to compute incomplete gamma functions and K-Bessel functions and 
the various special functions that came up in our work. My late ex-husband Riho 
Terras wrote the following ALGOL procedure to compute the incomplete gamma 
functions (see [673,674]). 


1. ALGOL PROCEDURE TO COMPUTE INCOMPLETE GAMMA FUNCTION 
G(S,X) 


100 REAL PROCEDURE G(S,X) ; 

200 COMMENT THIS FINDS INCOMPLETE GAMMA G(S,X) FOR 0<S<1 
AND X>0; 

300 VALUE S,X; REAL S,X; 

400 IF X GEQ 0.5 THEN BEGIN REAL W; INTEGER K; 

500 FOR K:= ENTIER (8+50/X) STEP -1 UNTIL 1 DO 

600 W:=X+W*(K-S) / (W+K) ; 

700 G:= EXP (-X) /W; 

800 END ELSE BEGIN 

900 FOR K:=ENTIER(17+X+X) STEP -1 UNTIL 0 DO 
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1000 W:=(1+X*W) / (S+K) ; 

1100 G:=X** (-S) *GAMMA(S)- EXP (-X) *W; 
1200 END; 

1300 END OF PROCEDURE G; 


The procedure G is based on two formulas. When 0 < s < 1 and x > 0.5, it uses 
the continued fraction for G(s,x): 


7 (2 l1—s 1 2=s 2 


a for Res>O. (1.22) 
x+ 1+ x+ 14 x+ 


This is due to Legendre. See Exercise 1.4.12 for hints on a derivation. You can find 
a discussion of continued fractions in Henrici [289, Vol. II, Chap. 12, especially 
p. 629]. The ALGOL procedure quoted above views the continued fraction (1.22) 
as a composition of fractional linear transformations: 


T,.(w) =x +w(k—s)/(w+k). 
That is, the continued fraction should be viewed as 
Ti- T(w) = (e*G(s,x))", (1.23) 


for the correct choice of w. In the ALGOL procedure, we replace w by zero. This is 
analogous to using the first k terms of a series. The recursions sending w to 7;,(w) are 
error-correcting when 0 < s < 1 and x > 0. The value of k used to compute G(s,x) 
by truncating the continued fraction via (1.23) is k = [8 +50/x], where [x] is the 
greatest integer < x, also known as the floor of x or |x|. Clearly this is very large if 
x is very near 0. In fact, the continued fraction is also bad when s > 1, because then 
the T, start to magnify rather than correct errors. See R. Terras [673,674] and Henrici 
[289] for discussions of error-correcting recursions. 

When 0 < s < 1 and 0 < x < 0.5, the ALGOL procedure above uses a power 
series: 


G(s,x) =x “T(s,x) =x “*T(s) —x *y(s,x), 


x 
when y(s,x) -| fe diese £ 
0 


n>0 (S)n-41 ; 


x 


(1.24) 


(s)o = 1, and (s), =T'(s+n)/T(s) =s(s+1)---(s+n—1). 
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Fig. 1.13 Graphs of the product of the Epstein zeta function with its gamma factors in the interval 
(0,1) from Purdy et al. [523]. Note that the x-axis does not cross the y-axis at y = 0. The figures 
show that the Epstein zeta functions graphed here vanish twice in the interval (0,1). Thus the 
Epstein zeta function does not behave like Riemann’s zeta function in the interval (0, 1) 


Exercise 1.4.12 (Properties of the Incomplete Gamma Function). 


(a) Show that if p(s,x) = e*G(s,x), then (s+ 1,x) = 1 +sọ(s,x)/x. 

(b) Prove formula (1.22) above. This can be done using part (a) or via recursions 
for the confluent hypergeometric function. 

(c) Prove formula (1.24) above. 

(d) Use part (a) to prove the asymptotic expansion for G(s,x) in the hint for 
Exercise 1.4.7. 


Your answer to Exercise 1.4.11 should resemble the graphs in Fig. 1.13, which 
were produced in the 1970s by a Burroughs 6700 computer at the University of 
California (San Diego) using the ALGOL program for incomplete gamma functions 
above. See Purdy et al. [523]. 


Exercise 1.4.13 (Epstein Zeta Functions for Positive Definite Matrices with 
Small Minima over the Integer Lattice). Let 0 < u < 1. Suppose Y is in P, with 
detY = |Y| = 1. Let 


my = min {Y [a] = aYa | a € Z” — 0}. 
Suppose that my < nu/2me or my-ı < n(1 —u)/2me. Show that if n is suffi- 


ciently large (dependent on u), Z(Y,nu/2) > 0. Conclude that Z(Y,s) vanishes in 
(nu/2,n/2). 
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Hint. Use the fact that G(s,a) > a~*T(s) — 1/s. Or see A. Terras [664]. 


The size of my in Exercise 1.4.12 is connected with part of Hilbert’s 18th problem 
(see Milnor [468]) and we shall discuss it in more detail in Volume II [667]. It should 
be noted here that there are forms Y in P, with |Y| = 1 and my as small as you like 
if n > 1. Moreover, there are forms with my > n/(27¢e), if n is large, as we shall 
prove in Volume II. In fact, we shall show that given s in (0, n/2), there exist Y in 
Pn with Z(Y,s) positive, negative, or zero. Related work is to be found in the paper 
of Sarnak and Strémbergsson [561]. 

Next we would like to consider some applications of Theorem 1.4.1 in algebraic 
number theory and crystallography. We shall discuss algebraic number theory first. 


1.4.3 Algebraic Number Theory 


Here we give a brief survey of the essentials of algebraic number theory. More 
details can be found in Hecke [260], Lang [388], Manin and Panchishkin [452], 
Narkiewicz [491], Samuel [554], Stark [623] and Stein [637] for example. There 
is a list of links to online number theory notes and teaching materials at 


http://www.numbertheory.org/ntw/lecture notes.html. 


There are many computer programs for number theory such as PARI and SAGE. 
Mathematica will do some algebraic number theory computations. See H. Cohen 
[95] for more information on computational algebraic number theory. 

Algebraic number theory originated with the work of Kummer, Dedekind, and 
others in the 1800s. They wanted to prove Fermat’s conjecture that for n = 3,4,5,..., 
the equation x” + y” = z” has no integer solutions x,y,z with xyz Æ 0. It was found 
that if one knows that the ring of cyclotomic integers 0, = Zlexp(27i/n)] has 
unique factorization into primes, then Fermat’s conjecture follows. Unfortunately 
Dn is only a unique factorization domain (UFD) for 29 of these rings (see Masley 
and Montgomery [455]). The Fermat conjecture was proved in 1995 by A. Wiles 
with the help of R. Taylor (see Wiles [740] as well as Taylor and Wiles [661]). 
We will have a little more to say about this in Chap.3. See also Diamond and 
Shurman [133]. A popular introduction is to be found in Mozzochi’s book [480]. 

Another question, which might appear initially to have nothing to do with 
cyclotomic integers, is deciding whether an ordinary integer is prime. This has 
become interesting to the CIA, NSA and all of us who carry out our business online 
because the first form of public key cryptography involves two huge primes, p and 
q, plus Fermat’s little theorem (see Simmons [602] or Terras [668]). This method 
is called RSA cryptography, and RSA is now a company. There are attacks on RSA 
cryptosystems. The primes p and g must be very large The RSA website (www. 
rsa.com) has a discussion. The suggested size of the modulus n = pq is 1024 bits 
(meaning that you must express the number pq base 2 and see something like 
21025 _ 1 zeros and ones). Thus the number of decimal digits is something like 308. 
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You would not like to see that number written down here. The RSA website says 
such a key is OK for corporate uses but not for extremely valuable keys. 

Algorithms for distinguishing prime numbers from composite numbers are 
discussed in H. Cohen [95]. One such algorithm involves algebraic number theory 
in cyclotomic fields (see Adleman, Pomerance, and Rumely [2]). Algorithms for 
factoring are also discussed by Cohen. 

Thus we are led to the study of algebraic number fields K = Q(@), œ 
algebraic; i.e., @ is a solution of an equation 


f(@) = @" + dn_1@" | +---+ay9 =0, a € Q. 


Assuming that the polynomial f(x) is irreducible, we say that n is the degree of K 
over Q. We shall write 


n=|K:Q]. 
Let œt) € C, j= 1,2, ..., n, denote the roots of the irreducible polynomial 
f(x) with @ € R, for j= 1,2, ..., rı and oY g R, for j= ri +1, ..., n. 


Furthermore, we shall order the roots so that the non-real roots come in pairs of 
complex conjugates as follows: 


o) = oUt»), j=ri+1, .., r +r 


We always use an overbar to denote complex conjugate (and sometimes equivalence 
class or coset in a group). The map from @ to œ} ) extends to a field isomorphism 
x — x) of K into C, fixing points of Q. We shall call these isomorphisms the 
conjugations of K. The images of K in C are called conjugate fields to K. A totally 
real number field is such that all conjugate fields are subfields of R, i.e., r2 = 0 and 
ryan. 

The trace of a € K is defined to be 


Trx/Q@ = Tra = y at), 
i=l 
The norm of «œ € K is defined to be 
Nx/ga@ = Na = JJa. 
i=l 
One has the properties: 
Trx/g@, Nxjg%€Q, Tr(a+B)=Tr(@)+Tr(B), N(@B)=NaNB. (1.25) 
Problems involving the ordinary ring of integers Z in the rational number field Q 


lead directly to the study of the ring Ox of algebraic integers in the number field 
K; i.e., solutions of equations 
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a" +an_10” | +---4+a9=0, with aj in Z, alli. 


Number fields generalize ordinary arithmetic since Og = Z. And the arithmetic of 
these generalized integers is quite fascinating. It turns out that, just as in physics, 
analysis is needed to study these seemingly purely algebraic objects. A direct way 
to obtain algebraic information is to study the Dedekind zeta function of K and other 
related L-functions. 

Since Ox is not, in general, a UFD, one has to replace integers by ideals. An ideal 
A in Ox is an abelian group under addition such that 0x2 C A and A + {0}. 
A principal ideal has the form a@Dx = (a), for some & in Ox. The product of two 
ideals 2 and 8 is the ideal defined by 


k 
AB = i X aibi 


i=1 


aeann}, 


The sum of two ideals 2( and %5 is the ideal 
A+B ={a+b| acA, be B}. 


The arithmetic of ideals works very much like the arithmetic of ordinary integers. 
One can show that every ideal can be factored uniquely (up to order) into a product 
of prime ideals. 

Using elementary divisor theory (i.e., Gaussian elimination over Z, which is the 
theory lying behind the fundamental theorem of abelian groups), one can show that 
every ideal % has a Z-basis or integral basis; i.e., 


A = Zai PZy8-:--PBZM%, for some @1,02,..., On E A. 


Thus we can view 2 as an analogue of a crystal lattice. Moreover, one finds that the 
quotient Dx /2 is a finite ring. We will say that the norm of the ideal 21 is 


Norm X = NA = #(Ox«/A). 


One can show that for a principal ideal (œ), N(œ) = |Na|, where the norm on the 
right is the product of conjugates of a. 

If we look at the ideal Ox = Za, @--- 6 ZO, then we can define the discrimi- 
nant as 


dx = det (a) > 


I<i,j<n 


To find a Z-basis for Ox, one would want to use a computer with a number theory 
package such as SAGE. We give a short table of examples at the end of our brief 
review of number theory. The divisors of dx are the primes p in Z such that the ideal 
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px factors into prime ideals in K with squares of prime ideals appearing. This is 
Dedekind’s discriminant theorem (see Samuel [554, p. 75]). 

As we said, one wants to know if Ox is a UFD for many applications. To measure 
how far Dx deviates from unique factorization, Dedekind defined the ideal class 
group Zx of ideals modulo the equivalence relation 


A~ Bo yA=B, forsome y#OinK. 


Multiplication of ideals induces a group operation on Ig which makes J; a finite 
group whose order hx is called the class number of K. The ring Ox is a UFD if and 
only if hx = 1. We have only considered ideals which are subsets of Ox. But it is 
also useful to enlarge this definition. A fractional ideal 21 is a subset of K forming 
an abelian group under addition such that 0.2 C A and AA C Ox for some a Æ 0 
in Ox. The arithmetic of fractional ideals is very like that of the rational numbers Q. 

It is usually fairly hard to tell whether the class number of a given field is one. 
For imaginary quadratic fields Q(vd) the answer was obtained during the 1960s 
(see Stark [625]). The imaginary quadratic fields Q(/ d) with class number one 
have discriminant d = —3, —4, —7, —8, —11, —19, —43, —67, —163. Proofs 
of this statement involve analysis, using either Epstein zeta functions or modular 
forms. The theorem was suspected for a long time because it is easy to produce big 
tables of class numbers for those fields. We will show how to do this in Chap. 3. 
See Watkins [719] for the solution of the problem of finding all imaginary quadratic 
fields of class number < 100. 

For real quadratic fields, the situation is very different. Tables of class numbers 
lead one to believe that there should be an infinite number of real quadratic fields 
with class number one, but no one can prove it (see Stein [637, p. 81], for a table 
of real quadratic fields of class number 1, or Williams and Broere [742]). Masley 
and Montgomery (see [455]) found all the cyclotomic fields with class number one. 
Previously Siegel had reduced the problem of determining the prime cyclotomic 
fields of class number one to a finite problem (see Siegel [600, Vol. III, p. 442]). 

Note that there is a problem computing with ideals, since they are infinite sets of 
numbers. Thus one often wants to go up to a field E D K such that the ideals of K 
become principal in E. Here E may not have class number one but, in any case, one 
can do the desired computation more easily in E. Hecke had to do this many times 
(see Hecke [258, p. 255ff], for example). Class field theory shows that E exists (see 
Cassels and Frohlich [79, Exercise 3, pp. 355—357)). 

There is a more surprising invariant of a number field waiting to be described— 
the group of units Ux = {u c Oglu! ED x}. This group is not very interesting 
if K = Q, since it then consists of +1 and —1. It is also not very interesting for 
an imaginary quadratic field since if m is square-free and negative then Ug jm) = 
{+1, —1} unless m = —1 or —3. If m = —1, you get the fourth roots of unity. If 
m = —3, you get the sixth roots of unity. However, the real quadratic fields contain 
infinite cyclic groups of units as well as +1 and —1. A generator of the infinite 
cyclic group of units is called a fundamental unit for the real quadratic field. You 
can find a fundamental unit for QV d), when d > 0, using the continued fraction 
expansion for vd. 
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In general, Dirichlet’s unit theorem says that the group of units has the form 
Ux = (€1) X +++ x (&) x W, 


where (€;) denotes the infinite cyclic group generated by €;, W is the group of 
roots of unity in K (a finite cyclic group), r = rı + r2 — 1, with r; as defined at 
the very beginning of the discussion of algebraic number fields. The g; are called 
fundamental units of K. The measure of the unit group is given to a certain extent 
by the regulator defined by 


Rx = det (log 


i) [ej 
el) | ) 
1<i,j<r 


with 


Since the norm of a unit is +1, it does not matter which embedding K — C is left 
out in the definition of the regulator. 

Minkowski’s lemma in the geometry of numbers (see Volume II [667] or Samuel 
[554, p. 55]) gives the standard proof of the finiteness of the class number as well as 
Dirichlet’s unit theorem. One can also deduce these things from the convergence of 
Dedekind’s zeta function (see Theorem 1.4.2 and Siegel [600, Vol. II, pp. 93-94]). 

Note that if r = 1, the regulator is the logarithm of a fundamental unit. This is a 
transcendental number (see Baker [21, p. 6]), as was proved by Lindemann in 1882. 
Thus one might expect the same of the regulator itself, but no one knows how to 
prove that. 

The search for fundamental units is a problem that we can only begin to attack 
after we have considered the results of Volume II. The connection between algebraic 
number theory and that of the general linear group of n x n nonsingular real matrices 
is basic to much work on this subject (see, e.g., T. Shintani [593]). 

Conjectures of H. Stark allow one to construct units of class fields of number 
fields from values of derivatives of L-functions. The units are not fundamental, in 
general, but the conjecture allows one to find the units explicitly in many interesting 
cases. This also relates to the 12th in Hilbert’s famous list of mathematical problems 
for the turn of the last millennium (see Stark [626,627] or C. Popescu, K. Rubin 
and Alice Silverberg [519]). 

One of the major problems of algebraic number theory is describing the growth 
of the product of the class number times the regulator as the discriminant goes to 
infinity. We shall see why one looks at the class number times the regulator after 
Theorem 1.4.2. In fact, it is hard to separate h and R with the available tools. The 
Brauer-Siegel theorem says that under certain hypotheses on the sequence of fields 
(e.g., fixed degree over Q), one has 


log(hR) ~ log|d|!/?__ as |d| > œ. 
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Table 1.4 Examples of number fields 


K DK dx Uk hg 
Q(vm) Z [vm] m or 4m {1,—1} = 1 for 
m<0 ; 
Sre ifm= m#z-1 only 9 
7 2,3(mod4) or —3 fields 
-free 
Q(/m) {1,—1} x (£) conjectured 
m>0 oi i maa find € by =1 for 
square = continued infinitely 
-free 2,3(mod4) fractions many fields 
1,—1}x (£) conjectured 
3) { ’ 
ae Z [$m] 3m find € by =1 for 
sometimes sometimes Voronoi’s infinitely 
-free ‘ 
algorithm many fields 
=1 for 
= +p”? r=% _] 
i i p ly 29 
2ri/m Z fezti/m fm= = on: 
Q (e ) fe ] ii — : nas distinct 
=p unction fields 


Thus, in particular, for imaginary quadratic fields (where R = 1), there can be only a 
finite number of fields with a given class number. Siegel did the quadratic version of 
the theorem. Brauer did the more general result using the theory of representations 
of finite groups and Artin Z-functions. The whole proof would be trivial if one knew 
that the Dedekind zeta function (defined below) has no real zeros near | (see Lang 
[388]). Unfortunately the behavior of the Dedekind zeta function near s = 1 is still 
a mystery. The possible zeros near | are called “Siegel zeros.” References for them 
include Purdy et al. [523], Stark [624], and Siegel [598]. See Odlyzko [503] for 
a survey of work on bounds on discriminants and estimates of class numbers and 
regulators. 

Table 1.4 gives some examples of number fields and their invariants. References 
for this table are Narkiewicz [491], Stark [624], Williams and Broere [742], 
Barrucand, Williams, and Baniuk [27], and Masley and Montgomery [455]. See 
Cohen [95] or Stein [637] for more information on computational algebraic number 
theory. 

The Dedekind zeta function of a number field K is defined to be 


= a= py 


ideals ACOK prime ideals $B 


for Res > 1. Unique factorization of ideals into prime ideals gives the Euler product 
above just as in Exercise 1.4.4 for the case K = Q. For o(s) is nothing other than 
Riemann’s zeta function. 

The convergence of Dedekind’s zeta function can be proved algebraically using a 
fact from algebraic number theory. One needs to know that if p is a prime in Z, then 
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pOx =P- PE, with prime ideals P; in Ox, 


such that NB; = pf. Now p” = N(p) = p**' and it follows that 
g 
n= 5 fiei. 
i=l 


It is also clear that if $8 is a prime ideal in K then POZ = (p), where p is a prime 
number in Z. This shows (using the Euler product for x) that for s > 1 


log éx(s) = X, X KNP <n), Y Kp“ = nlog¢ (s). 


PI p kel 
So we have 
Cx(s) < (C(s))” when s>1. (1.26) 


In 1916, Hecke figured out how to continue Cx(s) to the whole complex s-plane 
as a meromorphic function with a simple pole at s = 1 and residue 


Re s Cx(s) = 2" (27) hRw7! |d| 72, 
s= 


where 7,12 are the numbers of real conjugates and pairs of complex conjugates 
of K,respectively, h is the class number, R the regulator, d the discriminant, and 
w the number of roots of unity in K. This clearly relates class number-regulator 
problems to the Dedekind zeta function. Moreover, Hecke’s proof can easily be 
seen to connect these problems with the behavior of the Dedekind zeta function in 
the interval (0,1). 

Landau used Hecke’s result (plus Hadamard factorization and Cauchy’s integral 
theorem) to prove the prime ideal theorem: 


# {prime ideals P in K | NB < x} ~x/logx as x > æ. 


This generalizes the ordinary prime number theorem when K = Q. There is a proof 
in Goldstein [223]. See also Lagarias and Odlyzko [383]. 

We want to relate g(s) and Epstein’s zeta function Z(Y,s) using a method 
devised by Hecke [258, pp. 198-207]. This method embodies the essence of the 
relation between the general linear group and algebraic number fields. It will tell us 
all the properties of the Dedekind zeta function. 


Theorem 1.4.2 (Hecke’s Relation Between Dedekind and Epstein Zeta Func- 
tions). Let £1, ..., & be a system of fundamental units for the algebraic number 
field K. For x € R", set 


F ` 
Tj =Tle? 2", j=l,2,...,n. 
i=l 


1.4 Mellin Transforms, Epstein and Dedekind Zeta Functions 77 


Let C be an ideal class in the ideal class group Ix of K. For any ideal 2 in C, let 
A= Zo © Za) E- -P Z@,. Then define the matrix Y(C,x) € Pa by 


tg road ere A ; 
Gel (onia 
using the notation Y{A} = ‘AYA. If €x(s) is Dedekind’s zeta function, set 
Ax(s) = (27? dr"? a"? 15/2)" Ts)" Ex(s). 
Here the degree is n = |K : Q|, and r; is the number of real conjugate fields of K, ro 
the number of pairs of complex conjugate fields of K. Let Z(Y,s) be Epstein’s zeta 
function and 
A(Y,s) = n °T(s)Z(¥,5). 


We also use the notation Y? = \Y|—'/"Y for Y in Pp. Clearly |\Y°| = 1. Then Hecke’s 


integral formula is 
f A (Y (Ca), Z) ax. 
xe[0,1]" 2 


Here w is the number of roots of unity in K andR is the regulator of K. 


w 
a pAkc(s)= ¥ 
2"1nR Calg 


Proof. Step 1. The switch from a sum over ideals to a sum over elements of 
ideals. Recall that Zg is the ideal class group. Let C be an ideal class in Ix. Choose 
an ideal % in the inverse class C~!. Then AB = a@Ox = (q) for all ideals B in 
C. Thus 


Cx(s)= Ne $ Ma)”, 


Celk a€(A—0) /UK 


where Ux is the group of units in Ox and the quotient (X — 0)/Uxg means take 
representatives for the equivalence relation œ ~ ß if and only if œ = Bu for 
some u in Ux. Here we have used the multiplicativity of the norm and the fact 
that (B) = (œ) if and only if $ = au, for some u in Ux. 

Step 2. Products of T -functions as products of Mellin transforms. The Mellin 
transform formula for the gamma function is 


a “T(s) = Te yl exp(—ay)dy, fora>0, Res>0. (1.27) 
R 


Taking products of such integrals, we obtain 


78 
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(27d! a2) T(s/2 TC) (NA/N(a)) 


! —I/n m d 
-| Ny’! exp (—7 (|dx| N2) U Tr (@ya)) LA 
yE(R+)" +2 y 
where 
retro retro d ntn dy; 
, y y 
Ny= [[ x, Try= ¥ ep, — = [[ #, 
j=l j=l j=1 Yj 
and 


na 1, for j= 1,...,r1, 
J | 2, for j=ri+1,...,r1 +72. 


The definitions are set up to make 


ry+r2 


Ny= [] le" =N(@), y= (Oa), 
j=l 


for example. When we write @ya we mean the element of (Rt)(ri1+72) given by 
the vector with jth component |a/) |?y;. 

Step 3. Hecke’s change of variables. Hecke decided to send (y1, ..., yr;+r,) to 
(u,x1, ...,X;) via the equations 


yjoutj, jHl,...,m+n. 


Here the T; are defined in the statement of the theorem as products of powers of 
conjugates of fundamental units. If the fundamental units did not exist, then we 
would be able to deduce a contradiction to the convergence of Dedekind’s zeta 
function. Exercise 1.4.14 below is needed to perform the indicated substitution 
in the integral of step 2. Thus we obtain 


(2772 |dx|!/2a-"/?)* T(s/2)"T (s) (NA/N(&œ)) 


=n2" IR fregr Iusot t exp (nw (drn) 


Tr(@rer) J du dx. 
Here T = (T1, .--, Tn). 


Step 4. Perform the integral over u and switch the quotient modulo units to 
the domain of the integral over x. You obtain 


(2-7 Iden") r T 


= n2 Rw aÌT (ns/2)|dx|*!? F | Z(¥(C,x),ns/2) dx. 
Célg x€[0 1)’ 


1.4 Mellin Transforms, Epstein and Dedekind Zeta Functions 79 


To see this, note that the Dirichlet unit theorem says that if u € Ux then u = 
Kej! --- E% for some a; € Z where K is a root of unity in K. This decomposition 


is unique. Thus 


j= ja 


This completes the proof, once you do the following pair of exercises. 


Exercise 1.4.14. Show that the Jacobian of Hecke’s change of variables in step 3 
above is 


=y] vege (2 AR. 


dy 
o (x,u) 


Exercise 1.4.15. Verify that Y (C, x) is actually a positive definite symmetric matrix. 


Hint. Suppose that a in has the form & = Zajo; for a = '(a;, ..., an) in Z”. 
Then Y(C,x) = Tr(@ta). This gives a positive real number even if you plug in a 
from R”. To see that the matrix Y(C,x) is real and symmetric, you need to look at 
the permutation 


z= lo r rı+1 rtm rn+trn++l +> n 
OCU ooe r nm tmtl -e n r+ orp tre 


induced by complex conjugation on the conjugates of K. Then Tj) = Tj. 


Note that the value of the Epstein zeta function in Hecke’s integral formula is 
independent of the choice of ideal in C and integral basis of that ideal. It is also 
independent of the particular set of fundamental units that we choose. In particular, 
|Y(C,x)| = N2l?|dx|, since the integer matrix relating a Z-basis of 2% and a Z- 
basis of Dx can be diagonalized by elementary divisor theory using elementary row 
and column operations. Changing the ideal in C which is used replaces the matrix 
Y(C,x) by Y(C,x)[A] = ‘AY(C,x)A, for some A in GL(n,Q). Since we are taking 
determinants to be one in the integral formula, we are transforming by GL(n,Z), 
which leaves the Epstein zeta function invariant. A similar argument applies to 
changing the Z-basis of the ideal. 

If the field K is imaginary quadratic, the formula in Theorem 1.4.2 is much 
simpler, since then there are no integrals. Then it is in fact a very old formula, 
expressing the connection between ideals in imaginary quadratic fields and binary 
quadratic forms. We will see in Chap.3 that this leads to an algorithm for the 
computation of class numbers of imaginary quadratic fields. This explains how 
Gauss could make class number conjectures before class numbers were invented. 
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He was talking about class numbers of binary quadratic forms and conjectured that 
in the positive definite case there are only finitely many discriminants with a given 
class number. 


Corollary 1.4.1 (Properties of the Dedekind Zeta Function). 


(1) Cx(s) can be continued to a meromorphic function of s with a simple pole at 
s = l having residue 


Re s Cx(s) =2" (2)hRw !|d|~!/? 
s= 
and Cx(s) satisfies the functional equation 


A(s) = (2a 2r) "T(s/2)"T(s)" Cx (8) = Ax(1—3). 
(2) The Dedekind zeta function has the incomplete gamma expansion: 


ri noni! 
aot — i p J (c 2 Y (C,x)"la ! 


aezg x€[0,1)" 
+6 (“oS s) ¥(C.4)a) ) dx. 


Proof. The corollary follows from Theorems 1.4.1 and 1.4.2, using the fact that 
Y(C,x)~! = Y(C', —x), where C’ denotes the ideal class containing the ideal 21’ 
dual to the ideal X in C that we had selected. If X% is an ideal the dual ideal W is 
defined to be 


Ax(s) = 


= {P €K|Tr(a@B) € Z for all a € A}. 


See Lang [388, pp. 57-58], where it is shown that 2’ is a fractional ideal in general. 
You need to see that if œ’ is a dual basis for the Z-basis œ of the ideal i.e., if 


Tr (@;@;) = 5; = the Kronecker delta, 


a\ —1 F 
then Qi 1) =" (o G !) and q’ is a Z-basis for the dual ideal to that generated 
by @. 


The most important dual ideal is that for Ox itself. The different is defined to be 
dx = (Of) ~!. And Ndx = |dx|. The concept of dual ideal is really the same as the 
concept of dual lattice or dual subgroup of a locally compact abelian group. Poisson 
summation will always relate the sum over a lattice and that over the dual lattice. In 
the special case under consideration here our lattice is an ideal in Ox. We can view 
this as a subset of R” using the conjugations. More generally, a subset L of R” is 
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a lattice if L is a discrete subgroup of R” such that R”/L has finite volume. More 
information on Euclidean lattices can be found in Siegel [597]. The dual lattice L* 
to a lattice L in R” consists of the vectors u such that ‘wv is in Z for all v in L. For 
example, ifn = 1, L = aZ, a #4 0,and L* = YA See also Weil [726, Chap. 2]. When 
one replaces lattices by positive definite matrices, the dual lattice corresponds to the 
inverse matrix. 

If the class number were infinite or if the r fundamental units did not exist, then 
one could show that €x(s) is infinite, contradicting the bound that we found when 
Re s > 1 in formula (1.26). This point of view is developed in Siegel [600, Vol. II, 
pp. 93-94], where it is noted that Dirichlet discovered his unit theorem “als er im 
Jahre 1846 in Rom der Ostermusik in der Sixtinischen Kapelle zuhörte.” 

We see from the corollary that Cx(s) is negative when s is less than, but 
sufficiently close to 1. It might be expected that the Dedekind zeta function would 
behave like Riemann’s and thus its only possible zero in (0,1) would be at the point 
$, but no one has been able to prove this, even for quadratic fields (cf. Stark [624], 
Purdy et al. [523], Stark and Zagier [634]). In fact, one expects that any proof 
that works for the Riemann zeta function should be generalizable to the Dedekind 
zeta function. For example, Siegel says (see [600, Vol. II, p. 72]), “Sollte einmal 
ein Beweis fiir die Richtigkeit der Riemannschen Vermutung gelingen, so wird 
dieser vermutlich fiir alle, auch die von Hecke verallgemeinerten ¢-Funktionen 
gültig sein.” However, although this statement appears true for the case of Landau’s 
prime ideal theorem, it does not appear to work for real zeros of the Dedekind zeta 
function. For it is easy to show that the Riemann zeta function has no zeros in (0, 1) 
(cf. Exercise 1.4.10). But no one knows how to generalize this to Dedekind’s zeta 
function (i.e., there may exist “Siegel zeros”). 

Of course one still does not know whether the Riemann hypothesis holds for 
the Dedekind (or Riemann) zeta functions. Most computer work has been done for 
Riemann’s zeta function, but see Lagarias and Odlyzko [383] and the website www. 
Imfdb.org, which is a database for L-functions, modular forms, and related objects. 

We are interested in the sign of g(s) in (0,1) thanks to the Brauer—Siegel 
theorem. For the incomplete gamma expansion in the corollary allows one to say 
that Cx(s) < 0 implies 


2"\w!hR i . . 
reo > asum of integrals of incomplete gamma functions. 
The best way to analyze the terms on the right-side of this inequality seems to 
involve rewriting the terms as higher-dimensional incomplete gamma functions. 
If we can get good lower bounds on these integrals, then we get good lower bounds 
on the product of the class number and the regulator. Upper bounds come from 
(1.26), for example. 

It is tempting to try to use Hecke’s integral formula (Theorem 1.4.2) to deduce 
the behavior of Cx(s) for s in (0,1) from the behavior of Z(Y,s) for s in (0,n/2). 
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But we know from Exercises 1.4.11 and 1.4.13 that the behavior of the Epstein zeta 
function is very different from the expected behavior of Dedekind’s zeta function. 
One can also interpret Hecke’s integral formula (Theorem 1.4.2) as the compu- 
tation of the Oth Fourier coefficient of the function A, (Y(C,x),ns/2), considered as 
a periodic function of x. Siegel computes all the Fourier coefficients for K a real 
quadratic field in [599]. One obtains Hecke L-functions with grossencharacters y: 


L(s, x) = X LQNA. 


For a real quadratic field of class number one and fundamental unit €, an example 
of a Hecke grossencharacter is 


x((a)) =(a/oly"/8*, for æ in Ox. 


Here a’ means the conjugate of œ in K. For more information on Hecke grossen- 
characters, see Hecke [258, pp. 215-234, 249-287]. 

If K is imaginary quadratic, there is no integral in Hecke’s integral formula. 
Then the result analogous to that of Siegel gives a connection between nonanalytic 
Eisenstein series and Hecke L-functions with grossencharacters. An example of a 
grossencharacter y for an imaginary quadratic field of class number one is: 


x((a)) =a T, with e+ f even. 


In 1970 Damerell found L(n, %), for large n in Z*, to be an algebraic number times 
a certain explicit transcendental number (see Weil [727]). 

You might ask why one cares about values of L-functions at positive integers. 
The earliest result of this type was that of Euler, who showed that for Riemann’s 
zeta function (see Exercise 3.5.7 of Sect. 3.5), 


(2m) Bom 
2(2m)! 


¢(2m) = (-1)""! ,m=1,2,3, .... 


Here B, € Q is the nth Bernoulli number defined by 


n 
x X 
e*—1 T È Bazy 

n>0 ý 


Analogous results, using similar methods, were obtained by Shintani [593, 595] 
for the Dedekind zeta function of a totally real field. Interpretations involving p- 
adic interpolation of these values have been much discussed (see Coates [94] or 
Iwasawa [321]). There are also some very general conjectures of Lichtenbaum 
giving K-theoretic meaning to values of L-functions at negative integers (see Borel 
[51], Lichtenbaum [420], Serre [578]). 

Values of L-functions are of interest also because of their connection with 
Hilbert’s 12th problem which asks for an explicit construction of class fields E or 
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abelian extensions of number fields K. Here “abelian” means the Galois group of 
E/K is commutative or abelian. When K = Q, a theorem of Kronecker and Weber 
says that a finite extension E of Q with abelian Galois group must lie in a cyclotomic 
field E C Q(exp (27i/m)). When the base field is imaginary quadratic rather than 
Q, one adjoins values of modular functions (see Shimura [589], Borel and Chowla 
[53]). When K is arbitrary, Shimura has obtained results of this sort using algebraic 
geometry. Another point of view is expressed in Stark [626]. Stark’s idea is that 
values of L-functions can be used to find units which generate class fields and 
express the reciprocity laws of class field theory. He has proved this in classical 
cases and has convincing computer examples in other cases (e.g., the base field real 
quadratic or cubic). See also Popescu et al. [502]. 


1.4.4 Crystallography 


A three-dimensional lattice is the set of all linear combinations with integer 
coefficients of three linearly independent vectors vı, v2,v3 in RÌ. If you choose the 
standard basis vectors of RÌ, your lattice is Z°. If you think of atoms as points with 
legs, then by connecting atoms you can build up a crystal lattice such as that of salt 
(NaCl) pictured along with various other lattices in Figs. 1.14 and 1.15 from Born 
and Huang [57, pp. 383-384]. 

A crystallographic point group G is a subgroup of the group O(3) of rotations 
of 3-space which leave a lattice L invariant; i.e., for each v in L the vector gv 
is in L, for any g in G. Note that you can represent the elements of G by matrices 
with integer entries or by rotation matrices. Thus G is compact and discrete which 
means that G must be a finite group. There are 18 abstract point groups. If instead 
of isomorphism classes, you look at conjugacy classes in the group GL(3,R) of 
all nonsingular 3 x 3 real matrices, then you get 32 groups. See Table 1.5 for the 
abstract point groups. The groups C,, Dn, for n = 1,2,4 and T,O leave invariant 
the lattice Z?. The groups Cn, Dn, for n = 3,6 leave invariant a lattice with basis 
(1,0,0), (1/2, V3/2,0), (0,0, 1). The matrix —/ leaves all lattices invariant. More 
information on these things can be found in Birman [40], Hilbert and Cohn- 
Vossen [297], Janssen [326], Loeb [424], Lomont [427], Nussbaum [497], and 
Schwarzenberger [568]. 

The crystallographic space groups are discrete subgroups of the Euclidean 
group M(3,R) of motions of RÌ; i.e., the group generated by rotations and 
translations. The subgroup U of translations in the space group G is isomorphic 
to Z? and G is an extension of U by the finite point group K. There are 
219 nonisomorphic space groups. And there are 230 nonconjugate space groups. 
Schoenfliess, Federov, and Barlow worked this out around 1890, after Sohnke 
had listed the orientation-preserving space groups. See Schwarzenberger [568, 
pp. 132-133], for the fascinating history of this episode. It follows that there are only 
230 ways to form crystals (although recently quasicrystals have been discovered as 
we explain in Sect. 1.5.3). Crystallography is based on the study of such groups and 
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Fig. 1.14 Various crystal lattices: (a) simple cubic; (b) body-centered cubic; (c) face-centered 
cubic; (d) NaCl (From Born and Huang [57, pp. 383-384]. Reprinted by permission of Oxford 
University Press) 


their representations. Chemists know the character tables of these groups quite well. 
For the quantum mechanics of such a crystal will be ruled by the symmetry group 
of the crystal, and not the full group of rotations. This will become clearer after we 
discuss the Schrödinger equation in Sect. 2.1.3. 

But we do not want to discuss the representations of finite groups here. Nor do 
we want to discuss the quantum mechanics that can be derived from knowledge of 
the character tables of these finite groups leaving crystals invariant. The interested 
reader could consult some of the references above for these things. 
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Fig. 1.15 Various crystal 
lattices: (e) CsCl; 

(f) Diamond; (g) ZnS (From 
Born and Huang [57, 

pp. 383-384]. Reprinted by 
permission of Oxford 
University Press) 


Table 1.5 Abstract point groups 


Cyclic groups 
Cn = cyclic group with n elements, n = 1,2,3,4,6. 


Dihedral groups 

D,, = group proper rotations (i.e., det = 1) of a regular n — gon, n = 2,3,4,6. 
Tetrahedral group 

T = group of proper rotations of the regular tetrahedron 


Octahedral group 
O = group of proper rotations of the cube 


Point groups of the second kind 
add — I = —identity matrix to the group. Write C, = {/,—J} 
C4 x C2, Co x C2, D2 X C2, D4 x C2, D6 X C2, T X Co, OX Cy 


Instead we want to show how to use Theorem 1.4.1 to deal with some lattice 
sums that arise in physics. Given a crystal lattice Zv; 6 Zv2 ® Zv3, we can form a 
positive definite 3 x 3 matrix 


viv ‘Vivo ‘V1 V3 
Y = | tvv fvova ‘vov3 | = 1 [v1 v203]. 
"y3v1 ‘32 'V3V3 
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For any sum over the crystal involving powers of the distance of a vector from the 
origin we are really computing Epstein zeta functions, since 


w = QV +azv2 +.43V3 = (V1 V2V3) | a 


and ||w||? = ¥[al, if a = ‘(a,,a2,a3) is in Z3. Many sums of this sort are considered 
in Born and Huang [57, pp. 248, 385-390, 413], and Ziman [754, pp. 37-41], for 
example. 

Next let us consider the simplest example—the computation of the electrostatic 
energy of salt (NaCl). The size of this energy has an effect on the solubility of a salt 
in various solvents, as well as on the hardness and melting point of the crystal (see 
Hgjendal [298]). The potential energy of one ion with respect to all others in the 
lattice is 


vve? vve? 
t=- p 2E 
where ve, vie are the charges of the two kinds of ions, Lo is the length of an edge 
of the unit cell, L is the distance between two ions, and œ is Madelung’s constant. 
The sign +1 is determined according to charges of the two ions. 

A slight generalization of Theorem 1.4.1 leads to a simple way to compute 
Madelung’s constant (see Exercise 1.4.17). This method and variants, some of which 
use the Fourier expansion of Epstein’s zeta function in Chap. 3, have led to many 
papers in the crystallography literature (cf. the preceding references to Born and 
Huang or Ziman, plus Emersleben [161,162], Ewald [169], Sakamoto [553], and 
Zucker [756], or the multitudes of papers cited by these authors). The series involved 
in the computation of Madelung’s constant for NaCl (salt) is a difference of two 
Epstein zeta functions evaluated at s = 7 Since Epstein’s zeta function of a 3 x 3 


matrix has a pole at s = 3, the Dirichlet series does not converge at s = 5. Thus the 
series expressing Madelung’s constant does not converge absolutely, and a certain 
amount of care must be taken. 

Hgjendal [298] invented a method that just sums the terms of the original series 
in a straightforward way. He says that he finds Ewald’s method (see Exercise 1.4.17) 
“still less understandable to persons not acquainted with the highest of mathematics” 
than Madelung’s which involves the Fourier series for Epstein’s zeta function (see 
Chap. 3). 


Exercise 1.4.16 (Hgjendal’s Method for Evaluating Madelung’s Constant 
for Salt (NaCl)). Compute œ defined above for the NaCl crystal pictured in 
Figs. 1.14(d) and 1.16. To do this begin by computing $. (+)Lo/ZL for the eight ions 
in the smallest possible crystal pictured in Fig. 1.16. You should obtain 


3.93 1 
1 y2 y3 
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Fig. 1.16 NaCl lattice 


Then compute the sum over the next layer of six cubes surrounding this one 
completely. And continue in this way proceeding by layers similar “to a system 
of Chinese boxes.” We have included Hgjendal’s result so that you can check your 
program. He had no computer. The nth level has (2n)? — 1 ions to deal with. So 
his ninth division involves summing 8000 terms. Hopefully the reader will use a 
computer. Our result from the table below is very close to the values found by 
Madelung and be Emersleben, who obtained 1.747557. 


Ferra 3_ 3 La 
1 ae : i — 47 + %3 = rll 
2 division: I K A va T% 
RR + D Via 0.295739 
3 division: = —0.004729 
4 division: = 0.000679 
5 division: = —0.000221 
Total 1.747498 + 0.0005 
6 division: = 0.000100 
7 division: = —0.000059 
8&8 division: = 0.000040 
9 division: = —0.000028 
estimated for further divisions = 0.000007 
Total 0.000060 
added to the above 1.747498 
we obtain as our most accurate value 1.747558 


Exercise 1.4.17 (Ewald’s Method or the Method of Theta Functions). 


(a) Show that Madelung’s constant for NaCl is 


$, exp(ri(nı +m +n3))I n, 


n€Z3—0 


I = identity matrix. 


This is a special case of Epstein’s most general zeta function considered in 
Epstein [163], for example. The general Epstein zeta function is 


88 1 Flat Space: Fourier Analysis on R” 


Z(Y,g,h,s)= X, Ylatg)“exp(2zi‘ha), 
aeZ"-0 


if Re s > n/2, Y € Pa, gh E R”. 
(b) Define the general theta function by 


O(Y,g,h,t)= X, exp(—atY[a+g]+2zi‘ha) 
acZ"—0 


for Y € Pa, t > 0, g, h € R”. Show that theta satisfies the transformation 
formula 


O(Y,g,h,t) =t" Y" exp(—2mi'gh)0(¥—',h,—g,t7'). 


(c) Imitate the proof of Theorem 1.4.1 to show that Epstein’s general zeta function 
satisfies the functional equation 


A(Y,g,h,s) = n °T(s)Z(Y,g,h,s) 
= |Y¥|7!/ exp(—2ni'gh)A(Y~!,h, -—g,n/2—s). 


Also obtain an expansion of Z(Y,g,h,s) in incomplete gamma functions 
analogous to the expansion obtained in Theorem 1.4.1. 

(d) Use the result of (c) to write a computer program to compute Madelung’s 
constant for NaCl. Compare the results with those of Exercise 1.4.16. 


There are other applications of Epstein zeta functions. For example, this function 
is the simplest case of the zeta functions studied in Minakshisundaram and Pleijel 
[470] in order to obtain information about the eigenvalues of the Laplacian on 
compact differentiable manifolds (see also Singer [603]). And minima of Epstein 
zeta functions for fixed s have been investigated in order to find the best lattice to use 
in numerical integration among other things (see Delone and Ryskov [128], Fields 
[179], as well as Sarnak and Strömbergsson [561]). Applications of Epstein zeta 
functions to quantum-statistical mechanics are considered by Hurt and Hermann 
[310, Chap. 8]. See also Kirsten and Williams [354]. 


1.5 Finite Symmetric Spaces, Wavelets, Quasicrystals, 
Weyl’s Criterion 


In this section we consider a few topics that were not quite ripe for discussion in 
1985— finite analogues of symmetric spaces, wavelets, and quasicrystals, plus a 
topic of an earlier exercise (Weyl’s criterion). 
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1.5.1 Symmetric Spaces and Their Finite Analogues 


Applied mathematicians regularly approximate the continuous with the finite. But it 
might be argued that one should even start with the finite. D. Greenspan [231] gives 
many examples and says: 


“Tt is unfortunate that so many scientists have been conditioned to believe that, say, 10° 
particles can always be approximated well by an infinite number of points. For, indeed, to 
approximate a physical particle by a mathematical point is to neglect the rich structures 
of the atom and the molecule, while to approximate 10*° objects of any type by an infinite 
number of such objects is to have enlarged the given set by so great an amount that the given 
objects are entirely negligible in the enlarged set, or, more precisely, in any nondegenerate 
interval of real numbers, 10°° points form a set of measure zero.” 


The idea of replacing the continuous with the finite occurred to physicists in 
the 1950s. Quantum mechanics says the state function @ for a quantum-mechanical 
system at energy level E satisfies the Schrödinger equation Lọ = E@, where L is the 
Hamiltonian operator. See formulas (1.19) and (2.14), for example. Eugene Wigner 
had the idea in the 1950s of modelling this with a finite matrix eigenvalue problem. 
He produced a histogram for the eigenvalues of 197 random symmetric real 20 x 20 
matrices (with normally distributed entries). See Wigner [739]. The result was that 
the histogram looked like a semi-circle. Thus was random matrix theory born. We 
say more about this in our second volume on higher rank and general symmetric 
spaces [667]. 

Faced with the task of creating finite analogues of symmetric spaces, you may 
wonder how this can be done. It is possible that you are asking yourself: What is 
a symmetric space? You have seen one example: Euclidean space R”. In the next 
chapter you will investigate another example—the sphere. Chapter 3 concerns a 
third example the non-Euclidean Poincaré upper half-plane. We have been using 
the following definition of a symmetric space X. The definition says that X is a 
Riemannian manifold (roughly, a locally Euclidean space with a notion of distance) 
having a geodesic-reversing isometry at each point; i.e., for each x € X there is 
a map © : X — X, preserving the distance between points, fixing x and reversing 
geodesics through x. Here we are considering noncompact symmetric spaces, except 
for spheres. We speak of spheres and a little more about symmetric spaces in 
Chap. 2. We say more about general symmetric spaces in our second volume [667]. 

To obtain finite analogues of symmetric spaces and to create more examples, it 
is helpful to define Gelfand pairs. A Gelfand pair (G,K) consists of a suitable real 
Lie group G (which is both a group and a manifold) having a suitable subgroup K 
such that the algebra L'(K\G/K) is a commutative algebra under the operation of 
convolution on G. If (G,K) is a Gelfand pair, then X = G/K gives an example of a 
symmetric space. We will say more about Lie groups in Chap. 2. 


Exercise 1.5.1 (Gelfand’s Criterion). Suppose that K is a subgroup of the finite 
group G and there exists a group isomorphism T : G — G such that s~! € Kt(s)K 
for all s € G. Show that then (G,K) is a Gelfand pair. 
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For the case of Chap. 1, we were thinking G = R”, under vector addition, and 
K = {0}, but you can also take G to be the Euclidean group and K the subgroup of 


rotation matrices. You can identify G with the group of matrices of the form ( : i ) i 
where k € O(m) = O(m, R); i.e., ‘kk = I and the column vector u is in R”. Elements 


g= e 7 € G act on column vectors x € R” by gx = kx + u. Here K is the 


subgroup fixing the origin; i.e., the subgroup of matrices e 2) ,k € O(m). Thus 


K can be identified with O(m). This group action leaves invariant the Euclidean 
distance between points. In Chap.2, G = O(3) and K is the subgroup fixing the 
north pole which can be identified with O(2). In Chap. 3, G = SL(2,R), the special 
linear group of 2 x 2 real matrices of determinant 1 and K is the subgroup of rotation 
matrices. 

In this computer age it is natural and useful to replace the continuous with some 
finite approximation. Thus we can replace the real line R or the circle R/Z with the 
finite circle Z/nZ (or with a finite field F4). This leads to the Fast Fourier Transform 
or FFT which has revolutionized signal processing. This idea had already occurred 
to Gauss in 1805 while computing the eccentricity of the orbit of the asteroid Juno. 
It was not until the paper of Cooley and Tukey in 1965 that the method began to be 
widely used. Now FFT algorithms are to be found in computer software packages 
such as Matlab and Mathematica. In this section we shall merely sketch the subject 
of finite symmetric spaces. References for finite symmetric spaces are Ceccherini- 
Silberstein et al. [81], Diaconis [132], and my book [668]. The book by Diaconis 
contains many applications in statistics. 

If we carry out this program further, we can replace the orthogonal group of 
Chap. 2, G = O (3) = O(3,R), with G = O(3,F,), where F, is a finite field with q 
elements. Or we can replace SL(2,R) in Chap. 3 with SL(2,F;), and then look for 
subgroups K with the property that (G, K) is a Gelfand pair. We will say more about 
this later. 

Now let’s consider a finite analogue of Euclidean space. Suppose that Fy is a 
finite field. This implies that q = p”, where p is a prime. Here we assume that p Æ 2. 
If our finite analogue of the real field R is F,, then our analogue of finite Euclidean 
space is F7’. Our finite Euclidean group G = M (2,IF,) is the group of matrices of 


the form K X 


0 r) , Where the column vector u is in Fẹ and k € O(m,F,); i.e., k is 


ku 
01 
on x € Fẹ by g(x) = kx+ u. Here K can be identified with the subgroup of matrices 


an m x m matrix with entries in F4 such that ‘kk = I. Elements g = ( ) € Gact 


( f : ) where k € O(m, F4). Thus we can identify K with O(m,F,). We have a finite 


field valued analogue of the Euclidean distance given by 


d(x,y) = (xı =y) ++ (Xm — Ym}, for x,y € Fy. 
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Fig. 1.17 The Euclidean graph £5(2,1) drawn by Mathematica with the command ShowGraph 
[GraphProduct [Cycle[5], Cycle[5]]] after loading the Combinatorica package 


Of course, this “distance” has values in F4 and thus is not a metric. The group action 
leaves the Euclidean distance between points invariant. 


Exercise 1.5.2. (a) Prove that d(gx,gy) = d(x,y), for all x,y € Fọ and all g = 
t *) acting via gx = kx + u, fork € O(m,Fq) and u € FẸ. 


(b) Show that if G=M(2,F,) and K = O(m,F,), as above, then (G, K) is a Gelfand 
pair and thus G/K is a finite symmetric space. 


Now a new idea occurs to us. One can draw a picture of Fy by attaching a graph 
to it. Suppose we fix a nonzero element a € F}. We define the Euclidean graph 
E,(m,a) to have vertices the elements of Fẹ and edges between vertices x,y € F7 
iff d(x,y) = a. This is what is called a regular graph as it has the same number of 
edges coming out of each vertex. This number of edges at each vertex in E,(m, a) is 
the order of the set 


Sq(a)={ueF? | d(u,0) =a}. (1.28) 


The number |Sq(a)| is called the degree of the graph E,(m,a). It can be computed 
in terms of the quadratic residue symbol. See formula (1.32) for the case that m is 
even and Terras [668] for the case m is odd. Figure 1.17 shows the graph in the case 
E5(2, 1). In this special case we get a torus graph—a product of two 5-cycles. 

The adjacency operator A on a finite graph X acts on functions f : X — C by 


ANE) = E FO); 


yrx 


where the sum is over all vertices y adjacent to x, meaning that there is an edge 
between y and x. We use the symbol y ~ x to mean y is adjacent to x. Taking the 
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basis of functions %a(x) = 1 for x = a and %a(x) = 0 otherwise, the matrix of A 
is the usual adjacency matrix. For the Euclidean graph E,(m,a), the adjacency 
operator is 


Af\xa= Z f= d fet), (1.29) 
yek? ueSq(a) 
d(y,x)=a 


where S,(a) is defined by formula (1.28). The combinatorial Laplacian of a finite 
regular graph X of degree d is defined to be A = A — dI , where J is the identity 
operator; i.e., If = f for all functions on the graph. The definition emphasizes 
the parallels between graphs and symmetric spaces. See my book [668] for more 
information. 

We can find a complete orthogonal set of eigenfunctions of the adjacency 
operator of the Euclidean graph E,(m,a). These are the exponentials appearing 
in Fourier analysis on Fg- To define them, we first need to define the trace of an 
element of F4. Suppose q = p”, where p is a prime. Since x ++ x? generates the 
Galois group of F; over F,, the trace of an element x € F4 is 


r—1 


Tr we, /B, (x) = Tr(x) = xP ph pee ty? 


This is the usual trace in field theory, mapping elements of the field F, to the 
subfield F,. It has the same properties as those of the trace on extensions of number 
fields. See formula (1.25) in the discussion above of algebraic number theory. 


Define the exponential 
2niTr ('b 
ats exp ( miTr ( 2) 
p 


for b,x € Fọ, thinking that b and x are column vectors. Then {e| beFg i? 
is a complete set of characters of FG . This is spelled out in the following 
exercises. These characters are one-dimensional representations of the group Fy 
under addition. We will say a little more about group representations in Chap. 2. 
See also [668]. 


Exercise 1.5.3. Prove that ep(x + y) = ep(x)ep (y), for all x,y € FẸ- 
Exercise 1.5.4. (1) Prove that Age, = Àa, bep, where the eigenvalue 


hee = >, elu). (1.30) 


u€Sq(a) 


(2) Then show that the e, form a complete orthogonal set of eigenfunctions for 
the adjacency operator A, in the inner product space L (F3) consisting of all 
functions f : Ff > C. 
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The eigenvalues A, , for a and b Æ 0 from formula (1.30) satisfy 


[Aap] < 207. (1.31) 


proved using André Weil’s estimate for Kloosterman sums. See formula (3.97) for 
a definition of a Kloosterman sum and [668, Chap. 5], for more details on why the 
eigenvalues of the Euclidean graphs are essentially Kloosterman sums. 

One can show that the degree of these graphs is 


ae when a#0 and m is even. (1.32) 


ISq(a)| =4""'—e,((-1)?) q 
Here €, denotes the quadratic residue symbol for the finite field F}. That means 


1,if b is a square in F4, 
&(b) = 0,if b=0, (1.33) 
—l,if b is a non—square in Fy. 


There is a similar formula for the degree when m is odd. See [668]. 

A question raised by computer scientists and others about connected finite regular 
graphs of degree d is: Do the eigenvalues of the adjacency operator satisfy the 
following bound? 


max{ |A| | A € spec(A), |A| 4d} <2Vd—1. (1.34) 


Here spec(A) means the spectrum of A; i.e., the set of all eigenvalues of A. 
Connected d-regular graphs satisfying this bound are called Ramanujan graphs. 
Infinite sequences of examples of fixed degree with number of vertices approaching 
co were given by Lubotzky, Phillips, and Sarnak [434], at least for certain degrees. 
See also the books of Lubotzky [433], and Sarnak [555]. Ramanujan graphs are 
of interest to computer scientists because they lead to efficient communication 
networks. The rapid convergence of a random walk on the graph to uniform is 
behind this fact (at least if the graph is not bipartite). See [668] or Hoory, Linial, 
and Wigderson [302]. There is an interesting interpretation of the fact that a graph 
is Ramanujan. It is equivalent to the statement that the zeta function of the graph 
satisfies the Riemann hypothesis. See [671]. The connection with the conjecture of 
Ramanujan to be stated in Chap. 3 was the reason for the name of the graphs. 
Given the preceding formulas, for an odd prime p, and nonzero a € Fp, one can 
see that E,(2,a) is a Ramanujan graph when p = 3(mod4). Here we use the fact 


that €,(—1) = Cp. as proved in most elementary number theory books. Thus 
when p = 3(mod4), |Sp(2,a)| = p + 1, for a not divisible by p. The Ramanujan 
bound is then 2,/p and it holds by formula (1.31). 


Exercise 1.5.5. What about the Ramanujanicity of E,(2,1) for p = 1(mod4)? 
Check it out using a computer for the first 100 primes. 
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Next let’s say a bit more about the FFT. The beginning of the algorithm is to 
consider the Fourier transform on Z/nZ, when n = rs. If f : Z/nZ —> C, the Fourier 
transform of f is 


f= F foet”, for xe Z/nZ. 


yeZ/nZ 


It takes n? multiplications to compute this transform. But when n = rs we can write 
the transform as a double sum. For we can write x = x2r + x1, with x; € Z/rZ and 
x2 € Z/sZ and y = yis + yo, with yı € Z/rZ and y2 € Z/sZ. This gives setting 


—2ni 
Ww—en 


f= x ( £ fows awe) wert), 


y2EZ/sZ \y1EZ/rZ 


The inner sum is a Fourier transform over Z/rZ. The outer one is a Fourier 
transform over Z /sZ of the inner one times w™™2, The computation now takes sr? + 
rs? operations. This is n(r +s) which is a savings on n(rs). Repeating this idea when 
n is a power of 2, leads to the original Cooley—Tukey FFT. The FFTs have certainly 
revolutionized may things from weather prediction to medical imaging. See Terras 
[668] for more information and references. Other FFTs are discussed by Maslen 
and Rockmore in [454]. Driscoll and Healy [140] manage to obtain an FFT for the 
sphere. I am still wondering if there is a way to approximate the Fourier transform 
on the sphere with that on a quotient of finite orthogonal groups. 


1.5.2 Wavelets 


Fourier series expand functions as series of complex exponentials or sines and 
cosines. These functions vanish only at a discrete set of points. Wavelets provide 
a new approach. The function is expanded in a series of dilations and translations of 
a “mother wavelet” y. The mother wavelet is chosen to have compact support and to 
be differentiable a finite number of times. In 1988, Ingrid Daubechies showed how 
to construct such functions (called Daubechies wavelets). See Daubechies [117]. 
Figure 1.18 shows an example of a Daubechies wavelet as plotted by Mathematica. 

There is no simple formula for the Daubechies wavelet function. We will describe 
how to construct it using the scaling function @. See Fig. 1.19 for the scaling 
function corresponding to the wavelet in Fig. 1.18. Now wavelets are used for 
example in fingerprint analysis, for the JPEG2000 image compression algorithms, 
in voice recognition software. Here we give only a very brief introduction. More 
information can be found in Benedetto and Frazier [35], Bratteli and Jorgensen 
[63], Cipra [90], Ingrid Daubechies [117,118], Ingrid Daubechies and Lagarias 
[119], Strang [640], Strang and Nguyen [642], and Strichartz [643]. 
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Fig. 1.18 The Daubechies wavelet D4 plotted by Mathematica with the command 
Plot [WaveletPsi [DaubechiesWavelet [4], x], {x, -3, 4}, PlotRange 
-> All, PlotStyle -> {Red, Thick}] 


Fig. 1.19 The scaling function for the Daubechies wavelet D4 plotted by Mathematica 
with the command Plot [WaveletPhi [DaubechiesWavelet [4], x], {x, 0, 7}, 
PlotRange -> All, PlotStyle -> {Red, Thick}] 
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The scaling function must satisfy a scaling identity or dilation equation 


N 
p(x) = >) axp(2x—k), (1.35) 
k=0 
where the coefficients a, are chosen very carefully. The number of terms in the sum 
(N + 1) needs to be roughly five times the number of derivatives that you want y 
to have. Restricting @ to the integers turns the scaling identity into an eigenvalue 
equation for a finite matrix. Once you know ọ on Z, the scaling identity determines 
it on 5Z. Then use the same method to find @ on iZ. Continue inductively to find 
Ọ on 2”Z. Then continuity determines @ everywhere. You get the wavelet y from 
the wavelet formula 


N 
y(x) = ¥ (-1)ay-E(2x— k). (1.36) 


The scaling identity implies that we can compute the Fourier transform of the 
scaling function. We get 


so =r EE) (1.37) 


where p(y) is the trigonometric polynomial 


12 
Py) == ¥ ae ™. 
2 k=0 


Exercise 1.5.6. Prove formula (1.37). 


One should choose the coefficients a, such that p(0) = 1. Then we find that, 
upon repeating the process which gave us the formula for @ and using mathematical 
induction, we have a formula for the scaling function 


The recipe for choosing the coefficients comes from Ingrid Daubechies [117, 118] 
(see also Benedetto and Frazier [35, p. 38], or Ingrid Daubechies and Lagarias 
[119]). Once we have the formula for 6, we can use formula (1.36) to obtain the 
wavelet’s Fourier transform: 
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Next create the scaled and dilated functions 


2i!2y (2ix— K) = yal). 
One then hopes to represent a function f supported in [0,1] by the series 


YY (f, Wik) Wiel). Here the inner product is the usual one for L7((0,1]). 
j=0k=—co 
There is a method called subband coding that allows the computation of these inner 
products using just the scaling function coefficients ag. See J. J. Benedetto and M. 
W. Frazier [35, p. 46]. 

I gave an introduction to finite analogs of wavelets and connections with the 
representations of the finite affine group in [668]. Both Mathematica (Version 8) 
and Matlab (with the wavelet toolbox) will compute with wavelets. 


1.5.3 Quasicrystals 


In 2011, Daniel Shechtman won the Nobel prize in chemistry for discovering 
quasicrystals. He had discovered this structure in 1982 when he chilled a molten 
mixture of aluminum and manganese. The diffraction pattern of the material was 
surprising as it showed that the material had a tenfold symmetry. Such a symmetry 
can be shown to be impossible for classical crystals. See Marjorie Senechal’s book 
[575, p. 6]. Thus in 1992 the International Union of Crystallography changed its 
definition of crystal. Now it says a crystal is a solid with a “discrete diffraction 
diagram.” The classical definition had required the crystal to have translational 
symmetry. This will be lacking in quasicrystals. Hundreds of quasicrystals have 
been created in laboratories. Finally, in 2009, natural quasicrystals, not made in a 
laboratory, were found. 

Even earlier than all this (in 1974) Roger Penrose found nonperiodic tilings 
of the plane. Ultimately it was shown that these can be made with two tiles. See 
Gardner [198]. Earlier work had made use of many more tiles. The Penrose tilings 
were constructed by de Bruijn using a projection from a 5-dimensional lattice. 
Recently Farris has found a way to visualize such things using sums of projections 
of eigenspaces of the cyclic permutation matrix in five dimensions. See Farris 
[175]. [have attempted to imitate this construction in Fig. 1.20. We used the Matlab 
programs below. The first gives a function associated to the projection of 5-space 
onto a plane eigenspace of the 5 x 5 cyclic permutation matrix. The second gives a 
density plot of the function. Note the appearance of the Golden Ratio (1 + V5) 12: 
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Matlab Function giving Projection of 5-Spaces onto the Plane 


function [|v] = farris5(x,y) 

phi = (1+sqrt(5))/2; 

v =sin(2*pixx/phi)+sin(2* pix (x/phi’2+y/phi)) 
+sin(2* pix (—x/phi*2+y/phi*2)) 

+sin(2* pix (—x/phi—y/phi*2)) + sin(2* pi « (—y/phi)); 


Matlab Commands to Plot the Preceding Function 


% create matrix of farris5 values and density plot of it 
clear all 

figure 

n= input('n =’); 

fori=1:50*n, 

for j =1:50*n, 

matfarris5(i,j)=farris5(i/n, j/n); 

end 

end 

pcolor(matfarris5) 


Exercise 1.5.7. Create a function with a threefold symmetry in a similar way to 
that used to produce Fig. 1.20. See Farris [175]. 


So what does harmonic analysis say about all this? We take our discussion 
from Senechal [575] and Strichartz [643]. See these references for more details. 
First identify the diffraction diagram as the Fourier transform of the distribution 
obtained by putting delta functions at the nodes of the quasicrystal. Now we ask: 
How can we construct a quasicrystal, i.e., a set of nodes such that if we look at the 
distribution defined by putting delta functions at each node, the result will have a 
Fourier transform supported on a discrete set? 

The idea comes from de Bruijn’s method of creating Penrose aperiodic tilings 
of the plane (see [123]). Let’s look at the simplest case. Consider the one- 
dimensional lattice Z? in the plane and rotate it by an angle whose tangent is 

cos@ sin@ f 
sing =) and tan @ is 
irrational. Then take a strip |y| < b parallel to the x-axis. Now project the lattice 
points lying in this strip to the x-axis. That gives an aperiodic tiling of the real line. 
See Fig. 1.21 for an example known as the Fibonacci tiling of the real line built 


irrational. Obtain the lattice A = AZ*, where A = ( 


using the golden ratio t = Ee The red lattice points in Fig. 1.21 have the form 


((m +tn)/V1+1t2,(n—tm)/V1 +?) , for m,n € Z. Those in the horizontal strip 
of width 4 about the x-axis (marked by green lines in Fig. 1.21) are then projected 
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Fig. 1.20 A Matlab plot of the function farris5(x,y) defined above 


onto the blue points on the x-axis in Fig. 1.21. We leave it to the reader (and perhaps 
Google) to uncover the connection between the golden ratio and the Fibonacci 
numbers. 

The distribution corresponding to the set of points 


n= { (n,m) | n € A =AZ?, |n| <b} 
is 


Lt)= X  ôlt-m)tER. 
n=(n1,n2)EA 
|ng|<b 
Why should this have a discrete Fourier transform? If we were to eliminate the 
restriction on the y-coordinate of the lattice points, in fact, the points nı would be 
uniformly distributed on the x-axis by the Weyl criterion for uniform distribution. 
It is an exercise using Exercise 1.3.3 of Sect. 1.3 to prove this. Or see the discussion 
of Kronecker’s theorem in the next subsection. 

Figure 1.22 shows what Fig. 1.21 would have looked like if we had projected all 
the lattice points onto the real line rather than restricting ourselves to those with 
y-coordinates between —2 and +2. The blue line looks continuous, though it cannot 
be. It can only be dense in the real line as it is countable. 


100 


Fig. 1.21 An aperiodic tiling 
of the real line is given by 
blue points. They are 
projected from the red lattice 
points that lie between the 
lines y = +2. This is known 
as the Fibonacci tiling 


Fig. 1.22 The blue points are 
obtained if you project “all” 
points in the lattice of 

Fig. 1.21 onto the real axis, 
rather than just the points in a 
strip of width 4 as in Fig. 1.21 
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The distributional Fourier transform of L is by e275 Tt is an exercise using 
ned 
|no|<b 

the Poisson summation formula to show that if A = AZ?, where A is a rotation 


matrix, i.e.,’A = A~!, we have 


eo 2mins — x 
n=(n;,;n2)€A ned 
|n2|<b 


sin(27bnz) 
Tn 


ô(s— nı). 


This is a weighted sum of delta functions and thus is supported on a discrete set. 


1.5.4 Weyľs Criterion for Uniform Distribution 


In Exercise 1.3.3 of Sect. 1.3 we looked at a special case of a criterion due to H. 
Weyl and this special case really implies that the numbers jen |ne Z} or 
equivalently that the numbers an(mod1),n € Z, are dense in the circle R/Z when 
a is irrational. This is a result of Kronecker from 1884. Here we wish to discuss 
uniform distribution and the Weyl criterion which implies it, especially in the two- 
dimensional case known as the densely wound line in the torus. The result comes 
from a paper Weyl wrote in 1914. See his collected works [730, Vol. I, pp. 563-599]. 
See also A. Granville and Z. Rudnick [229], H. Iwaniec and E. Kowalski [320, 
Chap. 21], T. W. Korner [372, pp. 11-13], or S. J. Miller and R. Takloo-Bighash 
[467]. In Chap. 3, we will consider non-Euclidean analogues of these results. 


Definition 1.5.1. A sequence {x,} in R” /Z™” is uniformly equidistributed (with 
respect to Lebesgue measure on R” /Z”) iff 


1 n 
lim = X fa) = J f(x)dx, for all continuous f :R”/Z” —C. (1.38) 
k=1 
R” /Z" 


Theorem 1.5.1 (Weyl’s Criterion). A sequence {xn} in R”/Z” is uniformly 
equidistributed iff for all b € Z” — 0, we have 


Het Y eri b =. (1.39) 


n>n k 


To sketch the proof in the <— direction (which can be found for m = 1 in Körner 
(372, pp. 11—-12]), we just note that formula (1.39) is the case of formula (1.38) with 
f(x) = e” ‘bx. To use this case to prove formula (1.38) for any continuous function, 
just uniformly approximate the continuous function by trigonometric polynomials 
in several variables (as we know we can do this via Fourier series). 

The following Exercise is just a rewrite of Exercise 1.3.3 of Sect. 1.3. 
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Exercise 1.5.8. Suppose that a is irrational and 0 < c < d < 1. Show that, if the 
floor of x = |x| = the greatest integer <= x 


1 
nit aj—|aj| € [c,d] | j=1,...,n} > d—c, asn>o, 


Then show that this implies Kronecker’s theorem which says that the numbers 
aj(mod1), j € Z, are indeed dense in R/Z when a is irrational. 


Hint. Find continuous functions on [0,1] uniformly approximating the indicator 
function Xic a(x) = 1 for x € [c,d] and Xje.q\(x) = 0 otherwise. See Körner [372, 
pp. 12-13]. 


The effect of Weyl’s criterion is to change an analysis problem to a favorite 
problem in number theory involving bounds on exponential sums which can often 
be obtained using results in algebraic geometry such as the Riemann hypothesis for 
zeta functions of curves over finite fields. It follows from the Weyl criterion that 
an infinite sequence of points of the form (j/n,(aj/n)+b), j = —d,...,d, ona 
line of irrational slope a in the plane induces a sequence of points which become 
equidistributed on the torus R*/Z?, as d —> œ Thus the line is called “a densely 
wound line in the torus.” 


Example 1.5.1 (Densely Wound Line in the Torus). Figures 1.23-1.25 give some 
idea of what is happening. Of course you cannot really see the difference between 
a line of rational slope and a line of irrational slope. Later we will look at similar 
pictures for the hyperbolic plane. In the figures, the torus is represented by a square 
where you must mentally identify the pair of horizontal sides as well as the pair of 
vertical sides obtaining first a cylinder and then a doughnut. Thus you really obtain 
a connected line on the torus or doughnut. 

We wrote a Mathematica program to do our torus line pictures. It involves the 
function: 


green|x_] := green|x] = 
If[(x — Floor[x]) <= .5,x — Floor|[x], x — Floor[x] — 1] 


which finds an integer translate of x € R which is in (—1/2, 1/2]. We also need the 
function 


br{a_,n_,d_] := br[a,n,d] = 

ListPlot[Table|[{green|j/n], green|ax j/n|},{j,—d,d}}, 
AspectRatio— > 1,PlotRange— > {{—1/2,1/2},{—1/2,1/2}}, 
Background— > LightBlue]. 


Figure 1.23 is a torus line coming from the line with slope 3 produced by the 
command br [3,500,500]. Figure 1.24 is a torus line coming from the line with 
slope 21/34 produced by the command br [21/34,1000,80000]. 
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Fig. 1.24 The figure shows points on a line with slope 21/34 
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Fig. 1.25 The figure shows some points on a line with slope e/2, where we have changed our 
command to get a pink background 


When we used the same command br [E/2,1000,5000] (except for a change 
in background color) for the torus line with slope E/2, we got Fig. 1.25. 

If we were to increase the last argument of the last variable in br [E/2,n,d] 
(which gives the number of points of spacing 1 /n, on the line through the origin of 
slope a) to the same value we used for Fig. 1.24, we can make the square totally 
dark blue in Fig. 1.25. Presumably you do not need to see that picture. However I 
can certainly take a rational approximation to e/2 and produce the same result. It is 
not really possible to see the difference between e/2 and a rational number. In fact 
br [2.718/2,1000,80000] produced a dark blue square as well. 


Before leaving this subject, because we are thinking about Kronecker as well as 
things that are ergodic, fractal, and chaotic (words we have not and may not ever 
define), we cannot resist looking at an example involving a series of sines that is 
due to Weierstrass. Kronecker did not approve of this example. In this example, we 
examine a function that is as non-smooth as one can imagine. In fact, people did 
not believe that such a function could exist before Weierstrass. Hermite described 
these functions as a “dreadful plague.” Poincaré wrote: “Yesterday, if a new function 
was invented it was to serve some practical end; today they are specially invented 
only to show up the arguments of our fathers, and they will never have any other 
use.” Even as late as the 1960s, before “everyone” had a computer fast enough to 
graph approximations to these things, such examples were viewed as pathological 
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3000 
Fig. 1.26 Mathematica plot of the sum > 1,.509-2)k sin (1.5*x) 
k=0 


monsters. Now there are thousands of websites with pictures of approximations 
of them. 


Example 1.5.2. (Weierstrass Continuous Nowhere Differentiable Function) The 
Weierstrass function is defined by an infinite series depending on two parameters 
À and s: 


oo 


fx) = by 262k sin (a‘x) , for A>1land 1<s<2. 
k=0 


That f(x) is continuous is an easy consequence of the Weierstrass M-Test for 
uniform convergence of series of functions. However it can be shown that, if A is 
large enough , the derivative f'(x) does not exist at any point x. See Falconer [172, 
Chap. 11], where it is proved that if À is large enough the graph of this function is a 
fractal with box dimension s as defined below. 


Definition 1.5.2. Suppose S is a subset of R”. For any € > 0, let N (€) be the 
smallest number of m-hypercubes in R” needed to cover S. The box dimension 
of S is then defined to be the following limit if it exists: 


logN 
dimgS = tim PENCE) 
€30+ log = 
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The box dimension is often called capacity. It sometimes differs from the 
Hausdorff dimension which we do not define here. But both definitions of dimension 
agree on most of the simple examples. One idea of a fractal is that it is a subset of 
R” with non-integer Hausdorff dimension. Mandelbrot coined the term “fractal” in 
1977 though many examples were already known at that time. He wanted to obtain 
a method to study phenomena that were not the usual ones we study in mathematics 
and said in his 1983 book on the subject “Clouds are not spheres, mountains are not 
cones, coastlines are not circles, and bark is not smooth, nor does lightning travel 
in a straight line.” Now fractals are of great use in computer graphics. One of the 
most beautiful fractals is the Mandelbrot set. Sadly the Weierstrass function does not 
seem so beautiful but it does resemble many phenomena of the modern world such 
as the rise and fall of the stock market. See Fig. 1.26 for a graph of the Weierstrass 
function of Example 1.5.2. 


Chapter 2 
A Compact Symmetric Space: The Sphere 


It is important that you establish a steady, rhythmic tempo while doing the exercises and 
that you rest as little as possible between them. Also remember not to strain yourself. 


—From Jane Fonda’s Year of Fitness and Health/1984, Desk Diary, Simon and Schuster, 
NY, 1983. Reprinted by permission. 


2.1 Fourier Analysis on the Sphere 


2.1.1 The Sphere as a Symmetric Space 


Whenever there is a large earthquake the Earth vibrates for days afterwards. The vibrations 
consist of the superposition of the elastic—gravitational normal modes of the Earth that are 
excited by the earthquake. 

—From F. Gilbert [212, p. 107]. 


A (surface or Laplace) spherical harmonic is an eigenfunction of the Laplacian 
on the sphere. These are the analogues of exponentials for Fourier analysis on 
the sphere. Laplace and Legendre introduced these functions in order to study 
gravitational theory in the 1780s. Spherical harmonics are necessary for the analysis 
of any phenomena with spherical symmetry; e.g., earthquakes, the hydrogen atom, 
and the solar corona. Some of these topics will be discussed later in this section. 

Since our treatment of harmonic analysis on the sphere is rather condensed, the 
reader may want to consult some of the following references for more information: 
Coifman and Weiss [98], Courant and Hilbert [111], Dym and McKean [147], 
Erdélyi et al. [164], Lebedev [401], Miiller [481], Sugiura [648], Talman [655], 
Wawrzynezyk [722], and Vilenkin [704]. For the history of the subject, see 
Wangerin [716]. 

Before discussing spherical harmonics, we need to understand something about 
the geometry of the sphere. This symmetric space is closely related to the orthog- 
onal group O(n) of real n x n matrices U such that ‘UU = I, where ʻU denotes 
the transpose of U and J denotes the n x n identity matrix. The special orthogonal 
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Fig. 2.1 Angular coordinates 
on the unit sphere 


group, SO(n), is the subgroup of matrices U in O(n) such that the determinant of 
U is one. You can regard U in SO(n — 1) as an element of SO(n) by forming 


« J inSO(n). 


The group O(n) is a compact group and the sphere is a compact symmetric space, 
making some of the analysis on it somewhat easier. 


Exercise 2.1.1 (The Sphere as a Quotient or Homogeneous Space). Consider 
the sphere S”! = {x € R” | ||x|| = 1}. Show that S”~! can be identified with the 
quotient space SO(n) /SO(n — 1), using the preceding identification of SO(n — 1) as 
a subgroup of SO(n). 


Hint. Map the coset gSO(n — 1) to the vector gen for g in SO(n—1) and en = 
(0,...,0,1). 


You may also want to read the discussion of the topology of spheres and 
orthogonal groups in Chevalley [85, pp. 52—67]. In particular, the fundamental (or 
Poincaré) group of SO(2) is isomorphic to Z, while that of SO(n), n > 3, has order 2. 

From now on we shall consider only the sphere S*. See the references for the 
general case. Now the sphere $Ê is a differentiable manifold. This means that 
locally it looks like two-dimensional Euclidean space. To make this precise, we use 
the usual angular coordinates (6,9), 0< @ < 2a, O < 0 < r, pictured in Fig. 2.1, 
to parameterize S? except for the semi-circle C through the poles and (1,0,0). 
A similar coordinate patch can be constructed to cover the rest of the sphere. The 
equations for the rectangular coordinates (x,y,z) of a point on S* in terms of the 
angular coordinates are 


x =sinOcos@ 
y= sin sing (2.1) 
z= cos 
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More information on differentiable manif olds can be found in Yvonne Choquet- 
Bruhat et al. [87], Helgason [276-278], Loomis and Sternberg [428], Singer and 
Thorpe [604], or Spivak [614], for example. 

Before proceeding further with our discussion of spherical geometry, let us 
review how to change variables in the Laplacian, using the method of Courant 
and Hilbert [111, pp. 224-225]. Assume that the substitution mapping (x,y,z) to 
(u,,U2,U3) is differentiable with differentiable inverse. Let the Jacobian matrix of 
the change of variables be 


a (x,y,z) 


A= : 
d (u1,u2,u3) 


(2.2) 


Then the volume element is 
dx dy dz = ||A|| du duz duz, ||A|| = absolute value of determinant of A. (2.3) 


The Euclidean arc length element is 


dx dui 
ds? = (dx dy dz) | dy | = (du, duz du3) "AA duz 
dz duz 
Thus we obtain 
3 
ds? = X, gijdu;idu; where G= 'AA = (gij)i<i j<3- (2.4) 


ij=1 


Similarly, one uses the fact that 


€ of af) _ (3 of Aan 


Ox’ dy’ dz Ou, Our’ duz 
to see that 
of of $ of N 2 yof of =I A 
o a Meee eas. 29 


To see what happens to the Laplacian in the new coordinate system, one can use 
the calculus of variations (see Courant and Hilbert [111, Chap. 4]). Suppose that f 
minimizes the integral 


ICP) = [RARI Mie 


subject to the constraint 
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K(f) = hae dx dy dz = constant. 
Then f must satisfy the Euler-Lagrange equation 


(see Courant and Hilbert [111, p. 192]). That is, f must be an eigenfunction of the 
Euclidean Laplacian. Suppose now that we change variables in the integrals J and 
K. The new constrained minimization problem should lead to a differential equation 
involving the transformed version of the Laplacian. And this is indeed the case. 
From (2.3) and (2.5), one obtains 


3 
=f > 8 fuu; v |G| du; du duz, with |G| = det G, 


i j=l 


and 


K(f) = | P/O] du din dus. 


The Euler-Lagrange equation for the transformed problem is 
3 ôf 
=—y|G I =À G|. 
Dga VEe 5, = AF VIG 
Thus the Laplacian in the new coordinate system is 


Af =fatfy+fe=l6l4 > lott Ș gt 2E (2.6) 
ce ee £ dui E Ou, i 


k=1 


There are other ways to obtain these formulas. See, for example, the following 
references: Arfken [13, Chap. 2], Yvonne Choquet-Bruhat et al. [87, p. 307], 
Churchill and Brown [89, pp. 16-19], or Helgason [277, pp. 386-387]). 

Now we can show that the sphere is a Riemannian manifold, meaning that there 
is a notion of arc length defined from an inner product on the tangent space to the 
surface at each point. In order to obtain the arc length element on S?, we shall use 
the preceding discussion to write the Euclidean arc length element in spherical polar 
coordinates. This is done in the next exercise. 


Exercise 2.1.2 (Spherical Polar Coordinates). Spherical polar coordinates for a 
point (x,y,z) in R? are defined by 


x=rsin@cos@, y=rsinOsing, z=rcos8, 
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where 0 < 9 < 2m, 0< 0 < m, 0< r. Use formulas (2.3), (2.4), and (2.6) above 
to transform the Euclidean volume, arc length, and Laplacian to spherical polar 
coordinates. 


Answer. 
du = dx dy dz=r*sin@ dr d0 dọ, 
ds? = dx? + dy? +d? = dr? + r° d0? + r° sin? 0 dq’, 
—_1 (072 pa), 2 (ang of Ə (_1 of 
Af= rt sin (2er sin 0 gr) + 00 (sino 54) T dp (sh a£). 


It follows from Exercise 2.1.2 that the element of arc length on the unit sphere 
S? is 


ds? = d0? + sin? 0 do’. (2.7) 


This element of arc length is invariant under O(3) as is the corresponding volume 
or area element: 


du =sin@ dé dg. (2.8) 


Finally the Laplacian on S? is 


1 (a(,,a\, 1 & 
“= sind (56 (sno55) " sin a) i 


Exercise 2.1.3. Show that the Euclidean Laplacian Af = fsx + fyy + fzz is invariant 
under O(3) (i.e., that A commutes with rotation). Deduce that the spherical 
Laplacian (2.9) is also invariant under O(3). 


Hint. Dym and McKean [147, p. 243], have a proof using the Fourier transform. 


Because the sphere is a Riemannian manifold, we can consider geodesics on the 
sphere. The geodesic through two points on the sphere is the curve through these 
two points that minimizes distance. Airplane pilots know that geodesics in S$? are 
great circles; i.e., the intersection of $? with a plane through the origin and the two 
given points on S*. To see this, suppose that you are given points p and q on S*. You 
can rotate S? so that both points p and q have -coordinate in (2.1) equal to zero. 
See Exercise 2.1.11 if you do not believe this. Then the distance between p and q 


on S? is 
q a; q 
I \/d02+sin oa> | d0 = 0 (4) — 0 (p). 
p p 


Thus the great circle route minimizes distance on the sphere (assuming that you go 
in the proper direction around the circle). 
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Exercise 2.1.4 (The Sphere Is a Symmetric Space). Show that $? is a symmetric 
space. To do this, it only remains to show that for each point p in S* there is a 
diffeomorphism fp : S? — SÊ that preserves arc length (i.e., fp is an isometry of the 
Riemannian manifold) and reverses geodesics. 


Hint. At p = (0,0,1), use (0,9) > (—0,-@). 


Spherical geometry is non-Euclidean because the geodesics cannot be extended 
indefinitely, which violates Euclid’s second postulate. In fact, Euclid’s fifth postulate 
fails as well. And Klein noticed that if you identify antipodal points on the sphere, 
then any two geodesic lines have a unique point in common. The geometry obtained 
by identifying antipodal points on the sphere is called elliptic geometry and the 
resulting compact surface is also called the projective plane. The term elliptic is 
due to Klein (from a Greek work elleipein, to fall short). It is motivated by the 
following picture. Suppose that two geodesic rays, R; and Rj, emanate from the 
ends of a geodesic segment perpendicular to them both. The distance between the 
rays Rı and R; will decrease or fall short. 

In elliptic or spherical geometry, when A, B, C are the three angles of a geodesic 
triangle, then the area of the triangle is A+B +C — m, measuring the angles in 
radians, of course. This result goes back to Albert Girard (1595-1632) for the 
sphere. References for non-Euclidean geometry are Coxeter [113] and Hilbert and 
Cohn-Vossen [297]. 


Exercise 2.1.5. Prove Girard’s formula for the area of a spherical triangle, which 
was mentioned above. 


Note. This could be proved using the Gauss—Bonnet formula (see Singer and 
Thorpe [604]) and, in fact, it was the first known case of the Gauss—Bonnet theorem. 


2.1.2 Spherical Harmonics 


It is now possible to make sense of the definition of a surface spherical harmonic, 
which was given at the beginning of this section. A surface spherical harmonic Y 
is a function Y : S$? — C such that 


x 1 f 1 
AY =- (cno Yo)o + sate) =AY (2.10) 


for some eigenvalue A € C. The theorem that follows characterizes these spherical 
harmonics very explicitly in terms of (associated) Legendre polynomials. Legendre 
had discussed these polynomials before Laplace’s work, but Legendre did not 
manage to publish his results until after Laplace did (see Legendre [404, 405] and 
Laplace [396]). Spherical harmonics also played a role in Laplace’s long treatise on 
celestial mechanics, as well as in Gauss’s work on terrestrial magnetism. 
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The following exercises give some of the properties of Legendre polynomials 
P(x) defined by 


d” 
2”n!P,(x) = Tan 


(ASI), forn=0,1,2, ..., (2.11) 


and associated Legendre functions P” (x) defined by 


Prg) = (1-2 ZR, (2.12) 


á dx 
Note that some authors replace (1 — x?) by (x? — 1) in formula (2.12). We follow 


Courant and Hilbert [111] and Arfken [13] rather than Lebedev [401] in our choice 
of notation. 


Exercise 2.1.6 (Legendre Polynomials and Associated Legendre Functions). 
(a) Show that the associated Legendre functions are solutions of the Sturm—Liouville 
equation: 


(1—x2)u" — 2xu! + n(n+1)— me u=0 
1-x2} 


(b) If n =0,1,2,..., show that P”(x) = 0 for all x, unless m = 0,+1,+2,...,-n. 
Note that we can allow m to be negative by writing (d/dx)~'(d/dx) = identity. 
Show also that 


P(x), 


(c) Show that for fixed m in {0,+1,+2,...}, the set 


{ Pr" | n= |m], |m| +1, |m|+2,...} 


is a complete orthonormal set for L? |—1, +1]. 


Hint. The orthogonality is easy. For completeness when m = 0, see Courant and 
Hilbert [111, pp. 82-83]. In the general case, write P% (x) = (1 — x7)"/? fm (x). 
Then {f}, k = 0,1,2,...} is a complete set of orthogonal polynomials for 
L?({[-1,+1], (1 —x*)”). This is the weighted L?-space with inner product given by 


E= f fs)e@)(I-27)" dx 


This means that you must show that fọ is orthogonal to x" when r =0,1, ..., k—1. 
Integration by parts, using the case m = 0, will do the trick. 
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Exercise 2.1.7 (Integral Formula for Associated Legendre Functions). Show 
that the associated Legendre P” is represented by the following integral: 


mI(n+m+1) 


T 
m = 2 n 
P(x) =i n(n 1) [ (x+ x? — l cos o) cos(m@p) dọ 


for Re x > 0, m = 0,1,2, .... 
Hint. See Courant and Hilbert [111, p. 505]. 


Exercise 2.1.7 has been generalized to symmetric spaces by Harish-Chandra. 
This will be discussed in later chapters. 


Theorem 2.1.1 (Spherical Harmonics). 


(a) The only eigenvalues à of the spherical Laplacian A* defined in (2.10) 
are à = —n(n+1), n = 0,1,2,.... The vector space of eigenfunctions of 
A* corresponding to the eigenvalue à = —n(n + 1) has dimension 2n + 1. 
A complete orthogonal set of eigenfunctions of A* is 


Y(0,9) =exp(im@)P,"(cos@) for n =0,1,2,..., |m|<n. (2.13) 


Here P” is the associated Legendre function (2.12) and the coordinates (0,0) 
are defined by (2.1). We call cY, with Y given by formula (2.13), a (surface) 
spherical harmonic of degree n and order m, where c is a normalizing 
constant. 

Let f(x,y,z) = r"Y(0@,@), using spherical polar coordinates (r,9,Q) corre- 
sponding to the rectangular coordinates (x,y,z) in Exercise 2.1.2. Then Y is 
a surface spherical harmonic satisfying 


(b 


SS 


A*Y = —n(n+1)Y 


if and only f(x,y,z) is a homogeneous harmonic polynomial of degree n. When 
we say that f is harmonic, we mean that f satisfies Laplace’s equation 


Exercise 2.1.8. Prove part (a) of Theorem 2.1.1, using separation of variables on 
A*Y = AY and Exercise 2.1.6. 


Note. There are other proofs that {A = —n(n+ 1), n= 0,1,2, ...} are the only 
possible eigenvalues of A* on S?. For example, see Dym and McKean [147, pp. 
252-253], Kirillov [353, pp. 271-274], or Van der Waerden [691, p. 21]. 


Exercise 2.1.9. Use separation of variables on Af = 0 to prove part (b) of 
Theorem 2.1.1. Note that Exercise 2.1.2 says that if f(x,y,z) = R(r)Y(@,@), then 
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Fig. 2.2 Zero set of 
P, (cos 0) 


1 2 pl 1 R * t d 
Af = ar R (r)) Y + aA Y — where = 


If Af = 0, then A*Y = AY and (7°R’(r))' = —AR, for some constant A. 


You should not be surprised that the eigenvalues of the Laplacian are real since 
the operator is self-adjoint. The next exercise explains why they are negative. 


Exercise 2.1.10 (Why the Eigenvalues of the Laplacian on the Sphere Are 
Negative). Suppose that f and A* f are in L? (S7). Show that 


(7, Af) <0. 


Use Green’s theorem. Then show that the eigenvalues of A* are all negative or zero. 


According to part (a) of Theorem 2.1.1, separation of variables in A*Y = AY 
leads from functions Y(@,@) of the two angle variables in (2.1) to functions 
exp(im@)P," (cos @). If we assume that Y is constant with respect to the angle 9, 
then Y = Y (0) = P,(cos@). Such a spherical function Y = Y(@) is called a zonal 
spherical function. The name results from the fact that the zeros of Y cut the sphere 
up into zones, as in Fig. 2.2. In general, P,(cos 0) has n distinct zeros in 0 < 0 < m 
which are positioned symmetrically about 0 = 2/2 (see Fig. 2.2). There is a more 
group theoretical definition of zonal spherical function, which is developed in the 
following exercise. 


Exercise 2.1.11 (Zonal Spherical Functions on K\G/K, where G = SO(3), K = 
SO(2)). Let G = SO(3) and K = SO(2) be considered as a subgroup of G as in 
Exercise 2.1.1. We know from Exercise 2.1.2 that the sphere can be identified with 
G/K. Show that the space K\G/K of double cosets KgK, g € G, can be represented 
by the cosets 


cos@ 0 sin@ 
KgoK, with gg= 0 1 0 >, O< O<z. 
—sinð 0 cos@ 
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Table 2.1 Surface spherical harmonics 


seals p) = (—1)m/ en Pan (cos @) exp(img) 


a = Jm 
gp Vè sin@ exp (+i) 
f l= P cos 0 
yx? = TET sin? 0 exp(+2i@) 
d p =F sa Sino cos 0 exp(+iọ) 


y= = a 0 exp(+3iọ) 

y? = J 8 sin? 0 cos @ exp(+2iọ) 

y! =F V& sino Pa 0 —1)exp(+iọp) 
n- [ih Goa — jou) 


When do two of these cosets coincide? The corresponding points to gK on the sphere 
are ge3, where e3 = ‘(0,0,1). So if 


cos@ sing 0 
kg = | —sing cos@ 0 |, 
0 0 1 


we find that k_pgge3 = ‘(cos @sin @,sin @ sin @,cos @). Thus a function on K\G/K 
is a function only of the angle 0. 


Let A = {g9,|0 < O < x}, with gg from Exercise 2.1.11. Then the decomposition 
SO(3) = KAK, which follows from the preceding exercise, is the Euler angle 
decomposition of SO(3). See Volume II [667] for a generalization of this decom- 
position and its application to harmonic analysis on general symmetric spaces. 


Exercise 2.1.12. Check Table 2.1 which lists the first few surface spherical har- 
monics. The table uses the Condon-Shortley [101] convention. 


We can use Mathematica to visualize spherical harmonics using density plots on 
the sphere. See Fig. 2.3 for two such pictures. On the left is that of Re Y (0 ,Q) 
obtained using the Mathematica command: 


ParametricPlot3D[{Cos|@]*Sin[6], Sin|@]*Sin[6],Cos[0]},{g,0,27},{6,0,z}, 
PlotPoints ->300,Mesh->None, ColorFunction->Function [{x,y,z, 0,0}, 
Hue [Re [SphericalHarmonicy [14,7,90,@]]]],ColorFunctionScaling-> 
False]. 


On the right is that of ReY,/,(@,@)+2Re¥3(0,@) obtained using a similar 
command. 
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Lo 


Fig. 2.3 Mathematica density plot for Y;/,(@, @) on the left and for ReY,/,(@, p)+2Re¥;(@, @) 
on the right 


These should be compared with analogous plots for the analogous functions on 
a unit square in the plane in Fig. 1.4, which we considered in Sect. 1.3. 


Exercise 2.1.13. (a) Make some similar plots to those in Fig. 2.3 using the Math- 
ematica command 


Re[SphericalHarmonicY [m,n, 0, @]]. 


(b) Solve the wave equation on the sphere with initial condition modelling what 
would happen if the sphere were struck with a hammer at a point. Then make 
a Mathematica movie of your solution. 

(c) Consider the effect of forcing a sphere to vibrate at some frequency near an 
eigenfrequency of A. 


Corollary 2.1.1 (Harmonic Analysis on the Sphere). Every function f : S? > C 
with continuous second-order derivatives can be expanded in an absolutely and 
uniformly convergent series of spherical harmonics. If we take Y;", |m| < n, to be 
an orthonormal basis for L? (S$? ,du), with du as in Exercise 2.1.2, then 


f(9,) = by = f(n,m)¥,"(8,9), 


n>0|m|<n 


where 
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Exercise 2.1.14. Prove the preceding corollary of Theorem 2.1.1. 


Hint. The corollary can be proved from the existence of the Green’s function for 
the problem (see Courant and Hilbert [111, p. 369]) or you might try to relate this 
problem to Theorem 1.3.1 of Sect. 1.3. 


Exercise 2.1.15 (The Gibbs Phenomenon for the Sphere). Find an analogue of 
Exercises 1.3.1 and 1.3.2 of Sect. 1.3 for the sphere. 


Hint. There is a discussion of the Gibbs phenomenon as well as summability 
procedures in Weyl [730, Vol. I, pp. 305-353, 376-389 and Vol. IV, pp. 432-456]. 


There are many facts about Fourier series that would be worthwhile to extend 
to Laplace series of spherical harmonics. For example, it is possible to obtain the 
analogue of the results of Slepian and Pollak (see Griinbaum et al. [234]). And it 
is possible to obtain a central limit theorem for the sphere or even SO(3) (see Clerc 
and Roynette [92]). 

Expansions in spherical harmonics are useful in many areas. For example, 
Backus and Gilbert [19], among others, have used such expansions relative to the 
vibration of the earth due to large earthquakes to study the structure of the interior of 
the earth. Some of the flavor of their work can be derived from the following quote. 


The Chilean earthquake of May 22, 1960 excited oscillations of the earth with periods 
from six to one hundred minutes which were observed continuously on strain gauges for 
265 hours after the main shock and at one-minute intervals on a gravimeter for 110 hr. In 
the Fourier spectra of these records, peaks were observed which corresponded well with 
the characteristic seismic oscillations computed by Pekeris and Gilbert and MacDonald for 
certain earth models. Some of the observed peaks, however, differed in period from the 
theoretical spectral lines, and some of the observed peaks appeared to be double or triple, 
the components being separated by as much as two minutes and the theoretical line falling 
approximately at their mean position. 

The displacements of observed peaks relative to theoretical frequencies presumably 
reflect slight errors in the earth model used in the theory and will not be discussed further 
here. The line splitting cannot be explained thus; we propose to explain it quantitatively as 
an effect of the diurnal rotation of the earth. 


2.1.3 Quantum Mechanics: The Hydrogen Atom 


According to the most simplistic version of quantum mechanics (i.e., neglecting 
spin and relativistic effects), the wave function y for the hydrogen atom satisfies 
the Schrédinger equation 


h e? 
( A+ )w=ev. r=(?+y42)?. (2.14) 
2m r 


Here —e is the charge of the electron, A = Planck’s constant divided by 27, A is 
the Laplacian (Af = fix + fyy + fz), m= mem,/(me+mp) is the reduced mass 
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of the system if m, is the mass of the electron and m, the mass of the proton. The 
eigenvalue E is the energy level of the system. Some references for these matters 
are Biedenharn and Louck [39], Castellan [80], Condon and Odabasi [100], Courant 
and Hilbert [111, pp. 341-343], Eyring, Walter, and Kimball [170], Mackey [442, 
pp. 159-271], Messiah [465, especially pp. 362 and 412], Sommerfeld [610, pp. 
200-206], B. Thaller [676], Van der Waerden [691, Chap. 1], Dorothy Wallace and 
J. BelBruno [713], Weyl [731, pp. 41-70], and Wigner [738]. 

The eigenvalues F that lie in the discrete spectrum of the Schrödinger operator in 
equation (2.14) will be the only ones of interest for our discussion. Such eigenvalues 
are negative (in fact, there is also a continuous spectrum of positive real numbers). 
Physicists interpret the values of E that lie in the discrete spectrum as energy levels 
of bound states of hydrogen. Spectroscopists going back to Balmer in the 1880s 
have found various series of lines in the spectrum of hydrogen corresponding to 
these energy levels. We can obtain some understanding of these spectral lines by 
finding the discrete spectrum of the Schrodinger operator very explicitly. 

We shall use spherical harmonics to separate variables via w(x,y,z) = 
w(r)Y(0,@) in equation (2.14) for the purpose of obtaining the discrete spectrum 
eigenvalues E£. Then Exercise 2.1.2 shows that (2.14) becomes 


(wY + (2mr*/h)(E+e?/r)w  A*Y 
w Y’ 


where ‘= L 
dr 
Both sides of this equation must be constant. Thus, by Theorem 2.1.1, 


A*Y =—n(n+1)Y, n=0,1,2,..., 


2 2 2 
(rw)! + > G + =) w=n(n+1)w. 


Using Exercise 2.1.16 on Laguerre polynomials we find that 
w(r) = r"exp(—cr/(2K)) LE) (cr/k), k€ Z,k >n, c =2me?/(h?). 
Furthermore, the corresponding eigenvalues are 


met 


2RR” 


E= (2.15) 

Physicists call k the principal quantum number, n the azimuthal quantum 
number. If Y = Y}, |p| < n, from Theorem 2.1.1, then p is called the magnetic 
quantum number. For each k, there are 


k—1 
K= > (2n+1) (2.16) 
n=0 
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linearly independent eigenfunctions of (2.14) with eigenvalue Ez. Physicists call this 
k’-fold degeneracy. For example, spectroscopists use the letters (s, p,d, f,q,...) to 
indicate the value of n corresponding to the state of hydrogen. They would say the 
ground state corresponds to k = 1 and is a 1s state. The first excited state corresponds 
to k = 2. It is said to be fourfold degenerate because it contains one 2s state and three 
2p states. 

Thus for each value k, n of the principal and azimuthal quantum numbers there 
are (2n + 1) experimentally indistinguishable states of the hydrogen atom. This 
mathematically explains the Zeeman effect in which spectral lines of hydrogen 
split into an odd number of lines when a magnetic field is switched on. The effect 
was first observed by Zeeman in 1896. 

Note that E£; given by formula (2.15) must equal hv, where v is the frequency of 
a spectral line. And we obtain 


v=Rk”, 


where R is the Rydberg constant, R = 22?me*h->. When an atom moves from initial 
state 1 with principal quantum number k; to final state 2 with principal quantum 
number kz, the frequency of the associated spectral line is 


v=R(k; — k’). 


The series of spectral lines of hydrogen observed by Balmer in 1885 has kz = 2. 
Thus the Balmer series lines have frequencies 


v=R(27?-k’), k=3,4,5,.... 


These lines are in the visible range. Lyman found a series of spectral lines in the 
ultraviolet range in 1909 with ky = 1. The series with ky = 3,4, are in the infrared 
range. The Balmer series is not obtained in the laboratory because it requires the 
temperature of stellar atmospheres. 

The theory just described does not account for the fine structure of the spectrum. 
Relativistic effects and spin must be considered. Even then, agreement between 
theory and experiment is not perfect. See Messiah [465, Vol. II, pp. 930-933], for a 
discussion of the Lamb shift. 

There is an additional degeneracy of (n+ 1)? rather than 2n+ 1, which was 
explained by Fock, who showed that if one formulates the Schrödinger equation 
for the hydrogen atom as an integral equation, then it is also invariant under SO(4). 
See Louck and Metropolis [430] for a discussion of a related problem which was 
motivated by Fock’s result. 

Spectroscopists daily analyze all sorts of materials—atoms, molecules, crystals, 
solids, gases. But, of course, the theory becomes much more complicated for the 
many-body problem. Group theory helps, as many of the references mentioned at 
the beginning of this discussion show. 
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Exercise 2.1.16 (Laguerre Polynomials). Set 


-dP 
Lp = Erpe 2) = the pth Laguerre polynomial. 
Z 


(2n+1) 


Show that w(r) = r” exp(—cr/(2k))Ly_ 


(cr/k) satisfies 


2 2 2 
(wY + 2 G + <) w=n(n+1)u, 


for E given by equation (2.15), k € Z, k >n, c = 2me*/(h?). Then show that we 
have found the only continuous bounded eigenfunctions w(r) for the differential 
operator involved here. 


Hint. See Arfken [13, Chap. 13], or Courant and Hilbert [111, p. 330]. 


We have found a wave function y solving equation (2.14) of the form 
__ ym n (2n+1) 
y(r, 6,0) =Y,"(8,@)r" exp(—cr/2k) Lie,  (cr/k), 


where Y,” denotes a spherical harmonic as in Theorem 2.1.1 and ia is the b® 
derivative of the Laguerre polynomial La. Then |y|* can be interpreted as the 
probability density of the electron for the various states of the hydrogen atom. 
Fig. 2.4 from White [733, p. 71], illustrates the first few states. 


2.1.4 The Sun’s Magnetic Field 


Using the measured magnetic field of the sun’s surface, the problem is to determine 
the magnetic field of the corona, assuming that the latter is current-free. The corona 
itself was not recognized until the last half of the nineteenth century. A new green 
spectral line was found to appear only in the corona and just above the thin layer 
next to the solar surface which emits the bright red Balmer line of hydrogen. By 
1930, eighteen spectral lines had been discovered in the solar corona. And by the 
early 1940s it was realized that the green line comes from forbidden transitions in 
highly ionized metal atoms. The lack of spectral lines of hydrogen and helium in 
the corona implies that these elements are completely ionized. The corona is mostly 
hydrogen plasma at one to two million degrees K. There are also cooler regions and 
warmer regions. Since the corona is too hot to be in thermal equilibrium, there is 
a solar wind. And observed coronal holes are sources of streams of plasma, which 
ultimately give rise to terrestrial phenomena such as aurorae and disturbances in 
radio transmission. A more complete discussion of these basic facts can be found in 
Altschuler’s article in Herman [290, pp. 105-145]. A fascinating account of solar 
theory can be found in Zirin [755]. 
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3s m=0 


Fig. 2.4 Photographs of the electron cloud for various states of the hydrogen atom as made from 
a spinning mechanical model. The probability-density distribution yy* is symmetrical about the 


@-axis, which is vertical and in the plane of paper. (From H.E. White, [733, p. 71]. Reprinted by 
permission of McGraw-Hill) 
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The idealized mathematical problem involved in determining the magnetic field 
of the solar corona using measurements from the solar surface can be posed as 
follows: find y, where 


Ay =0, R < [xl] < Rw, 
w(x) = f(x), if ||x|| = R, the sun’s surface, 
w(x) =0, if ||x|| = Rw, sufficiently far from the sun. 


The function f(x) is obtained from the measured magnetic field of the sun’s surface. 
The relation B = —grad y gives the magnetic field itself. The assumption is that 
there are no magnetic monopoles and thus y must be harmonic. So we can use 
Exercise 2.1.9 and Corollary 2.1.1 to Theorem 2.1.1 to obtain 


R n+1 anti (T\n\ eZ m 
vi, 0,9) = g — (Œ) a (=) F(n,m)¥n"(8, 9), 
|m|<n 


where a = R/R,, < 1 and 
finm) = |, f YF du. 


This formula is satisfactory as a solution in a textbook, but the numerical problem 
remains to be solved. See Altschuler’s article mentioned above for a description of 
the numerical work that led to the illustrations in Fig. 2.5. 

Another interesting application of spherical harmonics can be found in 
Chandrasekhar [83, Chap. VI]. The same sort of picture that gave us an idea of 
the shape of the probability distribution of the electron in a hydrogen atom should 
give an idea of the distribution of heat in a uniform sphere. Several applications 
of spherical harmonics to geophysics can be found in Gilbert [212], Kato [341], 
Stacey [616], and Trefil [683, pp. 106-111]. 

It is possible to generalize Theorem 2.1.1 to the unit sphere S” in R’*!, In 
fact, spherical harmonics for such higher-dimensional spheres have been of use in 
quantum mechanics, as have the representations of many higher-dimensional Lie 
groups. In the Lecture Note-Reprint collection Dyson [148] gives an interesting 
glimpse into the arguments concerning the choice of symmetry groups for nuclear 
and particle physics. The compact groups SU (n), n = 3,6, 12, seem to have attracted 
the most attention. Here SU (n) is the special unitary group of n x n complex 
matrices U such that ‘UU = I and |U| = 1. Some of the papers in the Dyson 
collection are devoted to showing that the representations of SU(6) are somehow 
incompatible with those of the Lorentz group O(3,1) which leaves the Maxwell 
equations invariant. See R. Hermann [291, pp. 144-149] and Talman [655, pp. 
186-187] for discussions of Fock’s work on applications of four-dimensional 
spherical harmonics to quantum mechanics. The standard model in particle physics 
has as its local gauge group SU(3) x SU(2) x SU(1). Representations of the group 
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correspond to the fields. There are many books on the subject. We leave it to you 
to Google them, or see Wikipedia. There is also the Grand Unified Theory, or GUT. 
This involves SU(5),SO(10) and even the exceptional Lie group Eg. Again there is 
much on the Internet including some beautiful movies related to Eg. 

Hecke and others have made use of these higher-dimensional spherical harmon- 
ics in number theory and the kinetic theory of gases (see Hecke [258, pp. 361-373, 
849-854]) and Ogg [505, Chap. VI, p. 6]. We shall see in the next section that 
spherical harmonics also give a new description of harmonic analysis on R”. 


2.1.5 Connections with Group Representations 


It is not difficult to connect spherical harmonics with representations of SO(3). 
Before doing so, let us sketch some of the basic facts about group representations. 
We will be sketchy because we do not intend to emphasize the representation- 
theoretic point of view. It would be useful for the reader to study these things 
however, and the following references will provide plenty of food for thought: Barut 
and Ragzka [28], Boerner [45], Gelfand, Minlos, and Shapiro [205], Gurarie [238], 
Hamermesh [246], Knapp [356], Mackey [442-444], Maurin [458], Sugiura [648], 
Talman [655], Van der Waerden [691], Varadarajan [692], Michelle Vergne [697], 
Vilenkin [704], Wallach [714,715], Warner [717], Wawrzyficzyk [722], Wey] [730- 
732], Wigner [738], and Williams [741]. Mackey’s introduction to Biedenharn and 
Louck [39] provides a translation of physicists’ terminology for mathematicians (or 
vice versa). 

Suppose that H is a separable Hilbert space and let GL(H) be the group of 
invertible continuous linear maps from H to H. We want to consider representations 
of a topological group G by elements of GL(H). By a topological group G we 
mean a group with a topology such that multiplication (x,y) — xy and inversion 
x= x7! are continuous maps. A topological group is locally Euclidean if there is 
a neighborhood of the identity e in G which is homeomorphic to an open subset of 
R”. A Lie group is a locally Euclidean topological group with group operations that 
are infinitely differentiable maps. Most, if not all, of the Lie groups considered here 
are groups of matrices like O(n). 

A representation of a topological group G is a pair (TJ, H) where H is a separable 
Hilbert space and T : G —> GL(H) is a continuous group homomorphism, using the 
strong operator topology on GL(H). Actually a stronger definition is made when G 
is not locally compact (see Kirillov [353, p. 111]). You might wonder why we want 
to represent groups of matrices by their infinite-dimensional analogues. The answer 
is that this cannot be avoided if you wish to do Fourier analysis on a noncompact 
group. 

Let (v,w), v, w € H, denote the Hilbert space inner product on H. We say that 
the representation (T, H) is unitary if (T (g)v, T (g)w) = (v,w) for all v,w in H and 
g in G. The representation (T,H) is said to be finite-dimensional if H is finite- 
dimensional and then dim H = degree of the representation. 
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Suppose that v, w are elements of H. The function g +> (T(g)v,w) = myy(g), 
for g € G, is called a matrix entry of the representation (T,H). Many of the 
special functions which are useful in applied mathematics are either matrix entries 
of representations of well-known Lie groups or are part of an orthogonal basis for 
the Hilbert space of such a representation. 

The theory of group representations is closely connected with the search for 
eigenfunctions of differential operators. For if a group G leaves a differential 
operator L invariant, then G takes eigenfunctions of L into eigenfunctions of L with 
the same eigenvalue. So G acts on the vector space H of eigenfunctions of L for a 
fixed eigenvalue A to give a representation of G. 

We say that a representation (T,H) of G is irreducible if there is no closed 
subspace W of H with {0} SW SH such that T(g)W C W for all g € G. 

Two representations (Tı, Hı) and (T2,H2) of a Lie group G are said to be 
equivalent if there is a linear bicontinuous bijection A : Hı — H2 such that To(g)A = 
AT, (g) for all g in G. Then A is called an intertwining operator. Or even more 
generally, if A is only continuous linear, it is still called an intertwining operator. 

If the group is R” under addition, the irreducible unitary representations are one- 
dimensional and a complete list of inequivalent irreducible unitary representations 
of R” under addition consists of the exponentials 


R™ = { ea(x) = exp (‘ax) ,for x€R” | aeR"} (2.17) 


Exercise 2.1.17 (Schur’s Lemma). 


(a) Suppose that (Tı, Hı) and (T2, H2) are finite-dimensional irreducible represen- 
tations of G that are inequivalent. If A : Hj — H, is a linear map such that 
ATı (2) = To(g)A, for all g in G, then A = 0. Prove this. 

(b) Suppose that (T, H) is an irreducible representation of G on a finite-dimensional 
complex vector space H. If A: H —> H isa linear map such that AT (g) = T(g)A, 
for all g in G, show that there is a complex number c such that A = cl, where I 
is the identity operator; i.e., Iv = v, for all v in H. 


Note. The spectral theorem for unitary operators allows you to extend the result 
of the preceding exercise to unitary representations on infinite-dimensional Hilbert 
spaces. 


We say that a finite-dimensional representation (T,H) of G is completely 
reducible if H = H; ©---®H,, with T(g)H; C Hj, for all g € G, provided that 
the representations (7;,H;) of G obtained by restricting T(g) to H; are irreducible 
forall i= 1, ..., n. We say T = T, @---@T,, = the direct sum of the T;. 


Exercise 2.1.18. Given a basis e1,..., €m of the representation space H of a finite- 
dimensional representation (T, H) of G, you can form the matrix corresponding to 
T(g). If e1, ..., €m is an orthonormal basis of H, then this matrix has (i, j) entry 


(T(g)ei,e;) =mj,i(g), for g € G. 
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(a) Show that two finite-dimensional representations (T, H) and (T2, H) are equiv- 
alent if and only if the matrix of T; is obtained from the matrix of T) by changing 
the basis of H that is used. 

(b) Show that a finite-dimensional representation (T,H) of G is completely 
reducible if and only if you can find a basis of H which puts the matrix of 
T in block diagonal form: 


so that the representation determined by the matrix T; is irreducible for each 
i=1,...,n. 


We will often need to be able to integrate functions on topological groups. The 
integral involved is an analogue of the Lebesgue integral on R” and is supposed to 
come from a countably additive positive measure on the Borel sets in the group. 
Such an integral on a topological group G is right invariant if 


f f(x) dx= f f(xa)dx forall ain G. 
G G 


The invariant integral on a compact Lie group was used by Hurwitz, Schur, and 
Weyl beginning in the 1890s. Haar proved the existence of a invariant integral on 
any locally compact topological group in 1933 and thus the integral is called the 
Haar integral. If G = R”, under addition, then the Haar integral is the standard 
Lebesgue integral. Proofs of the existence of the Haar integral may be found in 
Helgason [277, p. 365], Lang [387], Pontryagin [518], or Weil [725], for example. 

The invariant integral can be used to show that any irreducible unitary represen- 
tation of a compact group must be finite-dimensional (see Kirillov [353, p. 135]), 
for example. And you will need the invariant integral to do Exercise 2.1.20. In fact, 
integration on topological groups will be a necessary tool for the rest of this book. 
Of course, our groups will be matrix groups and thus the Haar integral for these 
groups is not so mysterious. 

The right-invariant Haar measure dx is unique up to a positive constant multiple. 
This means that for each g in G, there is a positive constant 6(g) (not to be confused 
with the like-named distribution) defined by 


J flex) dx = 8le) J f(x) dx. 


Then 6 : G— R* is a continuous homomorphism, which is called the modular 
function of G (not to be confused with the modular functions that appear in 
Chap. 3). Clearly this function relates the right- and left-invariant Haar integrals. 
In particular, one has 
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[#00 6) dx= f FO) ax 


The group G is said to be unimodular when 6(g) = 1 for all g in G. This means 
that right-invariant Haar integrals are also left invariant. Many groups that we shall 
consider are unimodular. In particular, all compact or abelian groups are unimodular. 
Most of the Lie groups that we shall consider in this book are what is called 
“semisimple”; for example, the special linear group of all n x n real matrices of 
determinant one. It can be shown that all semisimple Lie groups are unimodular (see 
Helgason [277, p. 366]). However, not all groups are unimodular. For example, the 
group of upper triangular matrices with positive diagonal entries is not unimodular 
if n > 3 (Exercise). 


Exercise 2.1.19. Show that any finite-dimensional unitary representation is com- 
pletely reducible and that its decomposition into irreducible representations is 
unique up to equivalence. 


Exercise 2.1.20 (The Haar Integral on a Compact Group). 


(a) Show that any compact group is unimodular; i.e., right Haar measure = left 
Haar measure. 

(b) Use Haar measure to show that any representation of a compact group is 
equivalent to a unitary representation. 


Suppose that (Tı, Hı) and (7>,H2) are representations of G. Define the tensor 
product representation (7, $ P, H; ® H2) by 


(T ® Tr)(g)(v1 ® v2) = (T1(g)v1) ® (Th(g)v2) forg €G, vj € Hj, i= 1,2. 


See the references mentioned above for more information on tensor products. The 
decomposition of tensor products of representations into direct sums of irreducible 
representations has much importance for quantum mechanics. For example, if 
one ignores the interaction between two electrons in the field of a positive nucleus, 
the Schrödinger operator of the system has eigenfunctions which are products yı yo 
of eigenfunctions y; corresponding to representations T; of O(3), i= 1,2. And the 
product y y2 corresponds to the representation Ti © T2. The Clebsch—Gordon series 
breaks 7; ® T up into its irreducible components. Thus one can conclude what sort 
of spectral lines should occur for such a situation. See the references mentioned in 
Sect. 2.1.3 on quantum mechanics for more details. 

There are many other useful topics in representation theory such as induced 
representations, Frobenius reciprocity, Cartan’s theorem on the highest weight, 
and Weyl’s character formula. See the references on group representations for a 
discussion of these matters. 

The systematic study of group representations of finite groups began in the 
1890s with work of Frobenius, Schur, and others. In the 1920s, Weyl obtained the 
irreducible representations of the compact simple Lie groups such as G = SO(3) (see 
Wey] [730, Vol. II, pp. 543-647]). To do this, Weyl used his formula for the character 
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of a representation T, which is the trace of T(g),g € G. The representations of 
compact Lie groups G are studied by restricting them to a maximal abelian subgroup 
A or torus in G. Any representation of A decomposes into a direct sum of one- 
dimensional representations called weights. Among the weights there is a highest 
one and this characterizes the original representation of G, up to equivalence. We 
have actually seen an example of the theorem on the highest weight in our study 
of spherical harmonics, but this will not be developed here. Another result called 
the Borel—Weil—Bott theorem realizes the representations of compact Lie groups on 
sheaf cohomology groups (see Warner [717]). 

The character yr of a finite-dimensional representation T is defined by yr(g) = 
Trace(T(g)) for g € G. Define G = the dual of G to be the set of equivalence classes 
of irreducible unitary representations of G. For example, since the complete list of 
irreducible unitary representations is given by formula (2.17), R™ can be identified 
with R”. It is said that R” is self-dual. If G is compact, describing G is equivalent 
to describing the characters of G. 

When infinite-dimensional representations are required, the character is defined 
as a distribution when possible. If f € L! (G), define 


T) = | FETE) ds, 


which means that for x,y € H, we have 


(T(f)x,y) = i. f(g)(T(g)x,y) dg, 


with (,) = the Hilbert space inner product on H. If H has an orthonormal basis ej, 
then we can often define the character of T via 


Tr T(f) = d(T (f)ei,e;) where Tr = Trace, 


considering the map f ++ Tr T(f) as a distribution. When H is finite-dimensional, 


TeT(f)= | (Te T(e))f() ds. 


G 


When G is a “tame” unimodular Lie group the abstract Plancherel theorem (or 
Fourier inversion theorem) says there is a measure du on G called the Plancherel 
measure such that 


fle) = f, Tr T(f) du (T), 


for infinitely differentiable functions f on G with compact support. See Michelle 
Vergne [697] for a interesting discussion of this subject with many examples. 
Harish-Chandra obtained the Plancherel measure for real semisimple Lie groups, 
for which the Plancherel inversion formula does not always involve an integral over 
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all of the dual G. In Chap. 1 we saw the result for R” and R” /Z™. In this chapter 
we have considered the result for functions on SO(3)/SO(2). 

One of the main goals of these volumes is the explicit description of the 
Plancherel measure—not for functions on the group GL(n,R) but instead for 
functions on the symmetric space GL(n,R)/O(n). See Volume II (i.e., [667]) for 
this result and its history. 

Formula (2.18) below gives the Plancherel theorem for the group SO(3). 


Exercise 2.1.21 (The Irreducible Unitary Representations of SO(3)). Let 
{¥;", |m| <n, n=0,1,2, ...} denote a complete orthonormal set of spherical 
harmonics as in Theorem 2.1.1. Define a (2n + 1) x (2n + 1) matrix A,(g) for g in 
SO(3) by 

Yr (gx) = 3 (An)m Yn (x). 


|k|<n 


(a) Justify this formula for A,,(g) and then show that A„ (g) defines a representation 
of SO(3). 
(b) Show that A„ (g) is a unitary representation using 


[enone dx= f YOYE ax. 
S Se 


(c) Show that the representation An (g) is irreducible. 


In fact, the representations A,(g) form a complete set of inequivalent irreducible 
unitary representations of SO(3). This means that any function f in L?(SO(3)) with 
respect to Haar measure has a Fourier series expression, converging in the L? norm 


f(g) = Xpro(2n + 1) Trace [f(n)An(g)], where 
(2.18) 
f(n) = Jsoa) f(g) 'An(g) dg, dg = Haar measure on G. 


This is proved, for example, in Dym and McKean [147, pp. 256-261]. Vilenkin 
[704, pp. 440-457], examines the representations of SO(n) defined in an analogous 
way to that used for n = 3 in Exercise 2.1.21 and shows that if n > 3, then 
these representations do not exhaust all of the irreducible unitary representations 
of SO(n). Only the class-one representations are obtained in this way. A class-one 
representation (A, H) of G has a vector v in H such that A(k)v = v for all k in K. 
Here G = SO(n) and K = SO(n — 1) (embedded in G as in Exercise 2.1.1). 
Formula (2.18) giving Fourier analysis on the group SO(3) is a special case of the 
Peter—Weyl theorem for any compact group (see Weyl [730, Vol. II, pp. 58-75], 
Pontryagin [518], or Weil [725]). The theorem says that if {re = (m%), œ € A} 
is a complete system of irreducible unitary representations of a compact topological 
group G, then vdam® forms a complete orthonormal set in L? (G), where dg = 


ij 
degree of T%. The proof is not hard using the spectral theorem for compact self- 
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adjoint operators (Theorem 1.3.7 of Sect. 1.3). The compact self-adjoint operator 
used in the proof of the Peter-Wey] theorem is the convolution operator T f = g*f, 
where g is a fixed nonzero continuous function on G such that g(x) = g(x!) 
for all x in G. The convolution is taken with respect to Haar measure on G. The 
eigenfunctions for a fixed eigenvalue of T form a finite-dimensional vector space on 
which G operates by sending f(x) to f(xa) for a € G. This gives a representation of 
G. The eigenfunctions for T form a complete orthonormal set in L? (G) and thus so 
do the matrix entries of the unitary irreducible representations of G. It follows that 
the analogue of formula (2.18) replaces A, by the irreducible unitary representations 
of G and (2n + 1) by the degree of the irreducible representation. 

The fact that special functions come from representations often leads to an easier 
understanding of the myriads of formulas listed in books such as Erdélyi et al. [164, 
165]. 


Exercise 2.1.22 (Addition Formulas for Matrix Entries). Suppose that (T, H) is 
a finite-dimensional representation of G and that e1, ... ,eé, is an orthonormal basis 
of H. Let the i, j th matrix entry of T be mjj(g) = (T(g)ei,e;) for g in G. Here (,) 
denotes the inner product on H. Show that T (gh) = T(g)T (A) implies the following 
addition formula for the matrix entries: 


mi;(gh) = È mlh mg (8 


Exercise 2.1.23 (Addition Formula for Spherical Harmonics). For x,y in s?, 
prove that 


aee ntl 
E Koro) = ARC). 


|m|<n 


Note that both sides are unchanged if you replace x by gx and y by gy for g in 
SO(3). So the right and left sides of the equation can only differ by a constant. Find 
the constant by setting x = y and integrating over S?. 


2.1.6 Integral Equations for Spherical Harmonics 


It is possible to characterize spherical harmonics by integral equations rather than 
differential equations. This method often simplifies the theory, for the same reason 
that Green’s functions simplify the theory of differential operators (see Sect. 1.3 
and the discussion surrounding equation (1.16)). Examples of the method appear 
in Weyl’s work on spherical harmonics (see Weyl [730, Vol. II, pp. 386—399]), 
Selberg’s work on the trace formula (see Selberg [569]), and Harish-Chandra’s work 
on group representations (see Harish-Chandra [253,254]). The method of integral 
equations also makes it easier to extend results to finite, p-adic and adelic groups 
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(see Gelbart [200], Godement [213], Macdonald [440], Tamagawa [656]). This 
should not, however, cause us to forget the differential equations and the contact 
with applied mathematics. 

The following theorem was proved in 1916 by Funk [187] and in 1918 by Hecke 
[258, pp. 208-214]. 


Theorem 2.1.2 (The Funk—Hecke Theorem). Suppose that Y, is a spherical 
harmonic of degree n. Let f : [—1,+1] > C be continuous. Then 


i f (‘ux)Y¥,(u) du = 2Y, (x wf f(t)Pa(t) dt 


Proof. Since every continuous function on [—1, +1] can be uniformly approximated 
by zonal spherical functions P,,(x), it suffices to do the following exercises. 


Exercise 2.1.24 (Properties of Spherical Harmonics). 


(a) Show that 


4r 
Yin PC = Yn mn: 
f IOP) dx = =" Yao) 


Here mn = 0 if m Anand Onn = 1 if m =n. 
Hint. Use the addition formula of Exercise 2.1.23. 
(b) Prove that 


The Funk—Hecke theorem actually characterizes spherical functions. This result 
is generalized in Helgason [277, p. 439, Cor. 7.4]. And we shall also use this 
approach in later chapters. In order to generalize the Funk—Hecke theorem, one 
should view the spherical harmonic Y as a function on G = SO(3) and replace the 
formula in Theorem 2.1.2 by 


(1) L Y(xky) dk =Y (x) T Y(ky)dk forall x,y in G, Q19 


where dk is Haar measure on K = SO(2) embedded in G as in Exercise 2.1.1, and 
I = the identity matrix. Furthermore zonal spherical harmonics P are characterized 
by the integral equation 


o f P(xky) dk = P(x)P(y) forall x,y in G. (2.20) 
K 
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This characterization of zonal spherical harmonics is due to Gelfand [201] in the 
general case. 


Exercise 2.1.25 (The Group-Theoretical Version of the Funk—Hecke Theorem). 
Show that formula (2.19) implies the Funk—Hecke theorem by integrating (2.19) 
times f(y) where f : K\G/K > C. 


Exercise 2.1.26 (The Convolution Theorem for SO(3)). Let f,g be integrable 
functions on SO(3) with respect to Haar measure. Define the convolution of f and 


g by 


(fxg) (x) = f(u)g(xu-!) du for x in SO(3). 
so(3) 


Define the Fourier transform f (n) forn =0,1,... as in formula (2.18). Show that 


n 


(fxg)(n) = f(n)8(n) for alln =0,1,2, .... 


Hint. Use the fact that Ay is a representation. 


Exercise 2.1.27 (The Fundamental Solution to the Heat Equation on the Sphere 
S? C R*). Given the initial heat distribution (0), solve the following initial value 
problem: 


A*u(9,9,t) =u, u(0,9,0)=f(8), t>0. 


Here A* is the Laplacian on the sphere given by formula (2.10). 


Answer. u = Gx f, where 


G;(0,) = G;(0) = X, cnPr(cos @) exp[—n(n + 1)t]. 


n>0 


The constants cn are chosen so that G; — ô ast — OF. 


Note. G. Watson [720] considers various analogues of the Gaussian distribution for 
the sphere, in connection with various statistical problems such as that of studying 
the distribution of the unit normal vectors to the planes of all known comets. 
Another reference for group theory and statistics is the book by Diaconis [132] 
which includes an interesting story about testing for uniformity, Fisher, Jeffries, and 
continental drift on pages 170-171. 
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It is interesting to consider the method by which algorithms have been “justified” mathemat- 
ically in this field [computerized tomography]. While this method consists of mathematical 
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reasoning in a certain sense, the reasoning is far from rigorous. Approximations are 
introduced at many steps with only intuition as a guide to the error involved. We do not 
know of a single instance in which a tomographic algorithm has been justified in a truly 
rigorous sense. Thus, in contrast to some other workers in this field, we do not feel that one 
derivation is more rigorous than another, whether it is based on Radon’s inversion formula, 
the Fourier inversion formula or any other foundation. 

—From Shepp and Kruskal [588, p. 421]. 


2.2.1 Harmonic Analysis on R? in Spherical Polar 
Coordinates and Spectral Measures 


It is easy to see (cf. Exercise 2.1.1) that the Fourier transform on R” commutes with 
rotation. This leads to a new formulation of harmonic analysis on R” in terms of 
spherical harmonics. For simplicity, we shall consider only the case n = 3 here. The 
generalization to R”, n arbitrary, can be found in Coifman and Weiss [98] as well 
as in Stein and Weiss [636]. These results are due to Cauchy and Poisson for radial 
functions, and Bochner and Hecke in general (see Bochner [43, p. 235], and the 
article of Stein in J. M. Ash [15, pp. 104—105]). 


Exercise 2.2.1 (The Fourier Transform on R” Commutes with Rotation). If g € 
O(n) and f : R” — C, set (L(g) f)(x) = f(gx) for x € R”. Show that L(g) f =L(g)f, 
when f is a sufficiently nice function, that the Fourier transform of f exists (as 
defined in Sect. 1.2 for various classes of functions). 


Exercise 2.2.2 (J-Bessel Functions). Define the Bessel function of the first kind 
by the power series 


E co (—1)*(z/2)"** 
= Tet Dr(kK+V-+1)’ 


for |arg z| < m. 


(a) Show that y(z) = J, (z) satisfies Bessel ’s equation: 
y" + (I/z)y + (1— (v/2}°)y =0. 


(b) Show that J, (z) is represented by the integral formula: 


Tone: 1/2) [ia rj exp(izt) dt, 


Jv(z) = T 


when Re v > — 4 and |arg z| < 7. 


(c) Show that Jy- 1 (z) +Jy4i(z) = 2Jy(z). 
(d) Prove that 
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[2 „1 (1d \"sinz 
J =(— 1)? ae: =0, 1,25. 
n+. (Z) ( ) ma (==) z n i ni 


These functions J, ı are often called spherical Bessel functions. 


Exercise 2.2.3 (More Properties of Bessel Functions). 


(a) Asymptotics. Show that 


Jy(z) ~T(L+v) "(2/2)", as z— 0; 


Jy(z) ~ y/ Z cos (z— YE — F), as z> o. 


(b) Show that J,(z) = (—1)"J_n(z), when n = 0,1,2, .... Then show that if v is not 
an integer, Jy and J_, are linearly independent. 

(c) Functional Equation. Prove that Jy(—z) = exp(vzi)Jy(z). 

(d) Addition Formula. Set R = (r? +73 — 2rır2 cos @)'/? and prove that 


J, (R) = Zr G) 5 (m+ 3) In 4 (FDJ p4} (72)Pm(COs 8). 


rır2 m=0 


Theorem 2.2.1 (The Bochner-Hecke Formula). Suppose that Y (u), u € S* is a 
surface spherical harmonic of degree n (as in Theorem 2.1.1 of Sect. 2.1) and let 
f(ru) = g(r)¥ (u) for r € RĦ, u € S*, where g:R* >C, is such that 


[ lg(r)|r° dr <0, 
0 


Then the Euclidean Fourier transform as defined in Sect. 1.2 of f (assuming that f 
is nice enough for the transform to exist), is given by 


3/2 
vu) f g(s)s Ing (2mrs) ds. 


Proof. Let dv denote the element of surface area on S*. Then, using Theorem 2.1.2 
of Sect. 2.1, we have 


f(ru) = f : s exp(—2zirs 'uv)g(s)¥ (v)s? dsdv 
sERt Jves? 


= 2ny(u) | g(s)s? is exp(—2zrst)P,(t) dt ds. 


sERt 


The proof of this theorem is completed in Exercise 2.2.4. 


Exercise 2.2.4. Show that P,(x) = sr; 4 (x? — 1)” implies that 


dx” 
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+1 , (—i)" 
f _ &xp(—2mirst)Py(t) dt = "Fd 


Use integration by parts and Exercise 2.2.2. Then complete the proof of 
Theorem 2.2.1. 


(277s). 


How does the formula of Bochner and Hecke fit into the general scheme of 
harmonic analysis on symmetric spaces? Why did the Bessel functions suddenly 
appear out of the blue? To understand this, one must view R? as a symmetric space 
rather than just as an additive group. We explained this at the end of Chap. 1. It 
leads one to think of the full group of isometries of RÌ, namely, the Euclidean 
group M(3,R) of rigid motions of R?. If we consider only the orientation-preserving 
motions G, then G is the semidirect product of the groups T and K, where T consists 
of translations and is isomorphic to R?, and K = SO(3). One can view M(3,R) as 


a group of 4 x 4 matrices of the form e i , A € SO(3),b € R?. The matrix 


(4 4 sends x € R? to Ax+b. Clearly one can identify R? with G/K by mapping 


a coset gK, g € G, to g0 = the result of applying the motion g to the origin 0 in R?. 


Exercise 2.2.5. Do an analogue of the Bochner—Hecke theorem for R? for rotation- 
invariant functions f on RÊ. 


Answer. For nice rotation-invariant functions f on R*, one has 


oo 


f(pu) =2n / f(RJo(2npr)rdr. 


0 


The connection of Theorem 2.2.2 below with the representations of 
the Euclidean group (as well as the generalization to R”) is made in 
Vilenkin [704, Chap. 11] and Wawrzyficzyk [722]. The case of R? is 
discussed in Dym and McKean [147, pp. 263-273]. See also Helgason 
(277, pp. 402-403], and Talman [655, Chap. 12]. 


Exercise 2.2.6 (The Eigenfunctions of the Euclidean Laplacian in Spherical 
Polar Coordinates). Use Exercise 2.1.2 of Sect.2.1 to show that if Af = fxx + 
hythee =Af, and f (ru) = g(r)¥ (u) forr € Rt, u € S?, and Y is a surface spherical 
harmonic of degree n, then 


1 2 
g(r) = Vaan A= (27t) . 
Theorem 2.2.2 (Harmonic Analysis on R? in Spherical Polar Coordinates). Let 


enmt(ru) = 2m," (U) (re) PJ 1 (2mrt) forre Rt, u € S*, where {¥" | n= 


0, 1,2,...,|m| < n} denotes a complete orthonormal set of surface spherical 
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harmonics of degree n (as in Theorem 2.1.1 of Sect.2.1). Then any f in L? (R3) 
has a Fourier expansion (converging in the L?-norm) of the form 


=> yy (n,m, t)enmg (x \t? dt, 


n>0|m|<n°! t>0 


where 


finm) = f F}enmiO) dy 


Proof. It suffices to assume that f(ru) = g(r)¥?"(u) for r € RĦ, u € S*. Then we 
must show that 


g(r) =n) | (Js O7 J, 4 (2st)? dr) == J „+1 (2mtrs)s° ds. (2.21) 


This is done in the following exercise. 


Exercise 2.2.7. (a) Prove formula (2.21) by writing down the inversion formula 
for the ordinary Fourier transform (from Theorem 1.2.1 of Sect. 1.2) in spherical 
polar coordinates. Then use Theorem 2.2.1 to evaluate the inner Fourier 
transform. Finally, Theorem 2.1.2 of Sect. 2.1 and Exercise 2.2.4 should be used 
to complete the proof. 

(b) Show that formula (2.21) is equivalent by change of variables to a special case 
of Hankel’s inversion formula: (assuming v > -1 andr > 0): 


= f ysr) | xI) fle) dx dy. (2.22) 
0 0 


Hankel’s inversion formula (2.22) gives the spectral resolution of the singular 
Sturm—Liouville operator 


2 
ca= I (-ary+ — f) for x € (0,9) (2.23) 


This spectral resolution can be derived from a formula of Stieltjes, Stone, Kodaira, 
and Titchmarsh, which is itself a corollary of the Von Neumann Spectral Theorem 
for unbounded operators. Let us summarize the theory briefly, following Lang [387], 
Reed and Simon [540, Chap. 7], Stakgold [618,619], and some unpublished notes 
of J. Korevaar. More details can be found in these references as well as Dunford and 
Schwartz [145], Gelfand and Vilenkin [207], Levitan and Sargsjan [415], Maurin 
[459], Naimark [490], Titchmarsh [679], Weyl [730, Vol. I, pp. 195-297], and 
Yosida [748]. In particular, Dunford and Schwartz [145, Vol. II, pp. 1333-1392, 
1532-1533], provides an extremely careful discussion of spectral theory for singular 
differential operators, including the Bessel operator (2.23). 
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We want to consider linear, possibly unbounded, operators T : D —> V, where D 
is a dense subspace of a Hilbert space V. Define 


D* = {y € V| there is a z in V with (Tx,y) = (x,z), for all x € D}. 


Define T* = the adjoint of T by (Tx,y) = (x,T*y). We say that T is self-adjoint if 
D = D* and T = T*. The spectrum o (T) of the operator T is defined to be 


o(T)={4 EC | (T — A1)~! does not exist as a bounded linear operator on V }. 


The point or discrete spectrum of T (or the set of eigenvalues of T) consists of 
A € C such that (T — ÀI) is not one-to-one. The continuous spectrum of T is the 
set of A € C such that (T — AJ) is 1 - 1 and D = range (T — AJ) is dense in V but 
(T — AI)! is an unbounded linear operator with domain D. 

The von Neumann Spectral Theorem says that if T is any self-adjoint operator 
(bounded or not), there is a Stieltjes integral representation 


r= fae, T= f Ady, (2.24) 


for some family of projection-valued measures dE, (cf. Reed and Simon [540, 
Chap. 7]). The integrals are over the spectrum of T, which is real, because T 
is self-adjoint. The spectral theorem implies that for polynomials p(A), p(T) = 
J p(A)dE,.. It is possible to define f(T) for continuous functions f on the spectrum 
of T and then this same formula holds upon replacing p by f. 

In order to obtain the formula of Stieltjes, Stone, Kodaira, and Titchmarsh one 
must be able to compute the Green’s function or resolvent kernel G(A;x,y) 
defined by 


(T —AD v(x) = I G(A;x,y)v(y) dy. (2.25) 


For if we have a convergent integral of the form 


fw) = [T A-U "e(a)aa, 


then at any point c of continuity of g(A), we have 


z4 lime0+ (f(c +i€) — f(c —ie)) 
This is Poisson’s integral formula for a half-plane (see Ahlfors [3, p. 171]). It follows 


that the spectral measure for the operator T is given by the formula of Stieltjes, 
Stone, Kodaira, and Titchmarsh: 
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ri =(T—(A+i0)N)'-(T-(A-i0))“, (2.26) 
assuming that Ey is well-behaved. This formula is proved in Dunford and Schwartz 
[145, Vol. II], Lang [387, pp. 412—413], and Reed and Simon [540, p. 237]. Stieltjes 
[639] found the result in 1894. We will usually refer to formula (2.26) as the 
Kodaira—Titchmarsh formula for brevity. 

In order to apply formula (2.26), we need a way to compute the Green’s function 
G(A;x,y). We use the method given in Stakgold [618, 619]. Consider a Sturm- 
Liouville operator which is singular at a and b: 


Lf == (-(pf') +af), a<x<b. (2.27) 


Here “singular” means that either the interval is infinite or the functions w(x) or 
q(x) blow up, or p(x) vanishes at some point in [a,b]. The Hilbert space associated 
to (2.27) is L?({a,b],w) consisting of Lebesgue measurable functions f on [a,b] 
such that 


b 
[ Were dx <. 


The inner product for this weighted L?-space with weight w is 


(Fa) = [sew dx 


If one makes the correct assumptions about p,q, w and if one imposes the correct sort 
of boundary conditions at a and b, the operator L will be self-adjoint. In particular, 
p,q,w should be real and w should be positive. 

The Green’s function G(A;x,y) of formula (2.24) must satisfy 


Pick c in (a,b). We want to choose two linearly independent solutions p} and y, 
of Lu = Au, as follows. Suppose that p} solves Lu = Au and in addition @, lies in 
L?([a,c],w). Let y solve Lu = Au and y} € L’([c,b],w). Then 


ay 1 Samo), a<x<y<b 
On Po a<y<x<b A 


and 
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Example 2.2.1 (Spectral Measure for the Hankel Transform Using the Kodaira- 
Titchmarsh Formula). Consider the Sturm—Liouville operator given by formula 
(2.23). Then formula (2.28) becomes 


YAl2y), Ocx<y<eo 
D(qt/2x), O<y<x<oe. 


Jy (A'/2x)H 
Jy (al?) 


Ti 


G(A3x,y) = a 


<><>” 


Here Jy is the Bessel function of the first kind from Exercise 2.2.2 and HP is the 
Hankel function (or Bessel function of the third kind) defined by 


J_y(z) — exp(—vri)Jy(z) 


H! 
isin(vz) 


ala) = 


when v is not an integer. Take limits as v — n to define HY) when n is an integer. It 
follows from Lebedev [401, pp. 112-113], that the determinant c} = —2i/z in this 
case. 

In order to check this formula for G(A;x,y), one needs the following asymptotic 
properties of the Bessel functions from Lebedev [401, Chap. 5], (v > 0): 


Jy(x) ~ (3) sae as x > 0, 
Jy (x) ~ Z cos (x— YE- F), as x— ©, 
(1) LFY (2.29) 
Ay (x) ~ —4 (4) T(v), as x0, 
HP (x) ~ Žexp|i(x — 4— 4)], as xX — œ 


Note that if 0< v < 1, bothJy(A!/2x) and HP (A '/2x) are in Z? ([0, 1], x). One says 
that 0 is then in the limit-circle case in Weyl’s theory (cf. Stakgold [618,619]). If v > 
1, then Jy(A!/2x) is in L2({0, 1],x) and HS) (A!/2x) ¢ L2({0, 1],x). One says that 0 
is then in the limit-point case. Note that Jy(A!/2x) ¢ L? ([1,%],x) and HS) (Al/2x) € 
L? ([1,%],x) so that infinity is always in the limit-point case. 

To compute the jump of G(A;x, y) required by formula (2.26), assume that x < y. 
Then for v > 0, 


G(A + i0;x,y) — G(A — i0;x,y) 
= i (Jy(Al xe (A1/2y) — Jy(-4 12x) HEP (-42y)) . 


So we need the functional equations 
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Jy (—x) = exp(vzi)Jy (x), 
HP (—x) = exp(—vzi) (Hw) = 2Jy(x)) . 


These formulas imply 
G(A + i0;x,y) — G(A — i0; x,y) = indy (4 1P xJ (Aly). 
Thus we obtain from (2.24) and (2.26), 


ô(x—y) 


Xx 


1 se és 
=5 Jy(A¥2x) J, (41y) dy = f koan onpi. 


This is Hankel’s integral theorem. 


For higher-rank symmetric spaces, the method just used to compute the spectral 
measure proves rather unwieldy because the Green’s functions are much more 
complicated. Thus we will formulate another method in Chap.3 and Volume 
II [667]. For this method, one needs only the asymptotics and functional equations 
of the eigenfunctions appearing in the spectral expansion of the operator L, rather 
than the expression for the resolvent kernel, involving other eigenfunctions as well. 
The method is due to Harish-Chandra and it would be very interesting to derive 
Harish-Chandra’s Plancherel measure from the spectral theorem in a similar way to 
that which gives formula (2.26). 

One can also ask for the spectral decomposition of the operator in (2.23) on 
the finite interval (0,a). The problem still has a singularity at 0. But on (0,a), the 
spectral resolution of the operator L in (2.23) is given by a series rather than an 
integral, when v > —1: 


f(x) = x CmJv(Oymx/a), O<x<a, v>-l, (2.30) 
m=1 


where {Om }m>1 is the set of all positive roots of Jy (Gym) = 0 and 


bm = 2a p44 (Cyn 2 f E A Ora 


The Fourier—Bessel series expansion (2.30) is derived in Titchmarsh [679, Vol. 1, 
pp. 81-86]. See also Stakgold [619, pp. 305-308, 313-315]. It is quite interesting 
to let a approach infinity in (2.30) and watch (2.30) approach (2.22) (cf. Morse and 
Feshbach [479, Vol. I, pp. 762—766]). 


Exercise 2.2.8 (The Kontorovich—Lebedev Transform). Consider Bessel’s equa- 
tion with the role of the parameters interchanged: 


—(xw')' + pxw—-Aw/x=0, 0<x< œ. 
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Here u > 0 and À is the eigenvalue. Show that the Green’s function is 


1/2 1/2 


2 


y), 0<x<y<~m, 


I ygt x) K_ ya(u 
, — ivVÀ iva 
G(A;x,y) x), 0<y<x<e, 


I allt 1/2y)K_ z(u 1/ 


where J and K are the Bessel functions of imaginary argument. The definitions 
are 


I,(z) = exp(—vmi/2)Jy(zexp(mi/2)), —m <argz< 2/2, 
os 
Ky(z) = 5 exp(vti/2)Hy" (zexp(zi/2)), —m <argz< 7/2. 
See Chap. 3 and Lebedev [401] for more information on these functions. 


Show that K_y(x) = Ky (x) and J_y(x) —I)(x) = 2sin(vz)Ky(x)/7. Then prove 
the Kontorovich—Lebedev inversion formula: 


Bee 5 [  Kiy (Vx) Kiv (\/y) vsinh (zv) dv. 


This will be a central result in Sect. 3.1. 


2.2.2 CAT Scanners and the Radon Transform 


Modern X-ray scanners can reproduce the tissue density function f(x), x € RÊ, for 
a plane slice of a person’s head, for example. They operate by inverting the Radon 
transform, defined for k € R, u € S', by 


Rf(k,u) =| 


xu=k 
xER- 


f(x) ds, (2.31) 


where the integral is over a line in the plane and ds is the element of arc length. The 
inversion formula for this transform goes back to Radon [528] in 1917. It says that 


If 1 /1 ; 
fal=—zf aa (= [Ria Xu, u) au). (2.32) 


Here the integral over q in R* can be viewed as a Stieltjes integral or as a Cauchy 
principal value integral, assuming that f is continuous with compact support. 
References for the Radon transform include Bracewell [62], Dym and McKean 
[147], Gelfand, Graev, and Vilenkin [204], Herman [290], Helgason [279], Louis 
(431, 432], Ludwig [435], Elena Prestini [522], Quinto [524], and Shepp and 
Kruskal [588]. In fact, Funk [187] proved the analogue of (2.32) for S? rather than 
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S! one year earlier than Radon proved his result. Helgason [279] shows that a vast 
generalization of this theory is possible, viewing the Radon inversion formula as 
involving two dual integrations—one over the points in a hyperplane and the other 
over the hyperplanes through a point. The Radon transform appears in so many 
applications to signal processing that Matlab’s Signal Processing Toolbox has a 
Radon transform command. We look at finite analogues of the Radon transform 
in [668, pp. 71-72] 

The next sequence of exercises presents a derivation of Radon’s inversion 
formula. 


Exercise 2.2.9. (a) Suppose that f : R? > C is a Schwartz function. Show that the 
Fourier inversion formula can be written in the form 


fx) =f EfCux,u) du, 


ues! 


where 
Ef(t,u) = ; | 7 1 _ „ IRf(K,u)exp[2zir(t — 5) dk dr 


In the first formula du denotes the angle measure on the unit circle S!. The 
Radon transform Rf (k,u) is defined by (2.31). 

Let abs(r) = |r|, r € R. Show that abs(r) is not the Fourier transform of a 
Lebesgue integrable function. 


(b 


wm 


Jee 


Hint. Recall the Riemann—Lebesgue lemma. Note that if abs(r) were a function, 
then you could write: 


Ef(t,u) = ; i . abs(t — K)R f (k,u) dk. 


Exercise 2.2.10 (Derivatives and Fourier Transforms of Some Distributions). 
Define the Cauchy principal value integral by 


PV ( / fx) dx) = lim ( I... f(x) ax) 
Define 
O!,@) =PV | o(x) dx 
and 


0, p) =PV [x (p(x) - 9(0)) dx 
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for test functions @ as in Sect. 1.1. Prove that in the sense of distributions (see 
Sect. 1.1) the following formulas are valid: 


(a) x7! = (log |x|)’. 
(b) eS) 


(c) x7! = —in sgn(x), where sgn(x) = abs(x)/x. 
(d) abs(x) = —(22?)-!x-?. 
Hint. See Vladimirov [706, pp. 75, 86, 134], or Bracewell [61, p. 130]. 


Exercise 2.2.11. Derive Radon’s inversion formula from Exercises 2.2.9 
and 2.2.10, using properties of Fourier transforms of distributions. 


Note. The Hilbert transform of a function is 


Hf(x) = f OET L f(m). 


TJ- (x—t) 


Thus part (c) of Exercise 2.2.10 implies that 


Hf (x) = i sgn(x)f. 


Because Radon’s inversion formula (2.32) contains a derivative, it does not 
appear to be as useful for numerical calculation as the formula of Exercise 2.2.9. 
So those that have computed these transforms in practice have used approximations 


to abs(x) (cf. Herman [290, pp. 19ff], and Shepp and Kruskal [588]). 


Exercise 2.2.12. Compute the Fourier transform of the following approximation to 
abs(x) for x € R?: 


foy= {bh Esa 


0, otherwise. 


Compare the decay at infinity with that of |x|~* from part (d) of Exercise 2.2.10. 


Note. You can write the Fourier transform in the plane as a Bessel transform, much 
as we did in Theorem 2.2.1 for the Fourier transform in 3-space. See Exercise 2.2.5. 


Some History. In 1956, R. Bracewell used a method analogous to the Radon 
transform to study solar radiation. In the 1960s and 1970s CT scanners for medicine 
were developed independently by A. M. Cormack and G. N. Hounsfield. They were 
not based on the Radon transform. The Hounsfield algorithm was used in the first 
commercial CAT scanner made by EMI Central Research Labs. in the UK. The first 
patient brain scan was done in 1971. A. M. Cormack and G. N. Hounsfield shared 
the Nobel Prize in Medicine in 1979. 

The four illustrations in Figs.2.6 and 2.7 from the 1978 paper of Shepp and 
Kruskal [588] show a mathematical phantom on the bottom in Fig. 2.6 representing 
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Fig. 2.6 On the bottom is a simulation of a human head using 11 ellipses. On the top is a 
reconstruction using the algorithm embodied in the first commercial machine (EMI Ltd.) from 
180 x 160 strip projection data obtained by exact calculation from the image on the bottom. 
(From Shepp and Kruskal [588, pp. 422-423]. Reprinted by permission of American Mathematics 


Monthly) 
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Fig. 2.7 On the bottom is a reconstruction from the bottom image in Fig. 2.6 using the Fourier 
based algorithm of Shepp. On the top is a reconstruction using the algorithm used in the 1970s 
EMI machine from 180 x 239 strip projection data obtained by exact calculation from the bottom 
image in Fig. 2.6. (From Shepp and Kruskal [588, p. 424]. Reprinted by permission of American 
Mathematics Monthly 


a slice of a human head and reconstructions of this phantom by three different 
algorithms. The Hounsfield algorithm was used to produce the top part of Fig. 2.6. 
The bottom part of Fig. 2.7 was produced with a Fourier-based algorithm of Shepp. 
The top part of Fig.2.7 was produced with an algorithm used by EMI Ltd. in the 
1970s. Clearly progress was made. 
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A description of the progress made between the 1970s and 1980s is contained in 
the following quotation from Science 214 (1981), p. 1327: 


When CT scanners first became commercially available about 8 years ago it took 5 minutes 
to scan a patient’s head and 5 minutes for each computerized reconstruction of an image 
from the x-ray data. Now, because of advances in the design of the scanners and in computer 
technology, the newest machines can scan a head just 10 seconds and can reconstruct an 
image virtually instantaneously. According to Jay Thomas Payne of Abbott Northwestern 
Hospital in Minneapolis, the Mayo Clinic’s first CT scanner, which is only 5 years old, has 
been relegated to the clinic’s historical museum. 


Of course, despite the progress in speed and accuracy of CT scanners, they still 
expose people to radiation. In a New York Times article from August 21, 2012, Jane 
E. Brody wrote the following. 


But it [radiation] also has a potentially serious medical downside: the ability to damage 
DNA and, 10 to 20 years later, to cause cancer. CT scans alone, which deliver 100 to 
500 times the radiation associated with an ordinary X-ray and now provide three-fourths of 
Americans’ radiation exposure, are believed to account for 1.5 percent of all cancers that 
occur in the United States. 


Thus nuclear magnetic resonance tomography (NMR alias MRI) may be destined 
to be the tomography of the future. It uses magnetic fields rather than x-rays and thus 
is presumably less damaging to the body. Louis [431,432] considers applications of 
the three-dimensional Radon transform to NMR tomography. Spherical harmonics 
are used to improve the algorithm by studying the kernel (i.e., inverse image of 
0) of the transform (ghosts). In 2003, the Nobel Prize for Physiology or Medicine 
went to Paul Lauterbur and Sir Peter Mansfields for their work on MRI. See Elena 
Prestini [522], Chap. 8, for more details on the subject. She begins the chapter 
with a quotation from a New York Post article from 1939 about a talk by I. I. Rabi 
explaining why NMR scanning works. The quote is: “We are all radio stations.” 

There are many other applications of the Radon transform; for example, in radio 
astronomy (see Bracewell’s article in Herman [290, pp. 81—104]). And there are 
applications to partial differential equations; e.g., to find solutions of the wave 
equation (see Dym and McKean [147, pp. 137—139], and Helgason [279]. 

This concludes our discussion of harmonic analysis on the sphere. It would be 
possible to consider spherical analogues of many more results from Chap. 1. We 
shall leave this to the interested reader. For example, the group of isometries of the 
sphere is just O(3). A discontinuous subgroup I C O(3) has the property that any 
domain in the sphere can contain only finitely many points equivalent under I to 
any given point. There are very few such discontinuous T for the sphere or elliptic 
plane. They correspond to the regular polyhedra (see Hilbert and Cohn- Vossen [297, 
p. 242]). These are the analogues of the space groups considered in Sect. 1.4. It is 
also possible to discuss the analogue of the Poisson summation formula for T\ S? S 
T\G/K, G=SO(3), K = SO(2). We shall not do this here, since our main interest 
is the Selberg trace formula for noncompact fundamental domains (see Chap. 3). In 
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fact, we never examined the Selberg trace formula when G is the Euclidean group 
either (but see Hejhal [265]). 

There is work on fast computation of Fourier transforms on the sphere by Driscoll 
and Healy [140], for example. There are also finite analogues of Radon transforms 
(see [668]). 


Chapter 3 
The Poincaré Upper Half-Plane 


But why then, this mystical set-up of putting the definition before the proof? 
—From Lakatos [384, p. 154]. 


3.1 Hyperbolic Geometry 


... la géométrie non-Euclidienne. .. est la clef véritable du problème qui nous occupe.! 


—From a letter of Poincaré to Klein in 1881 appearing in Acta Math., 39 (1923), p. 100. 


3.1.1 The Basics: Arc Length, Area, Laplacian 


In Chap. 2, we considered a model for elliptic geometry, in which any two geodesics 
intersect, so that there are no parallels. Now we want to investigate a model for 
hyperbolic geometry, in which there are infinitely many geodesics through a given 
point which are parallel to a given geodesic. This sort of geometry was discovered 
by Bolyai, Gauss, and Lobatchevsky in the 1820s. However, Gauss never published 
his results, perhaps because the idea was controversial. In fact, Gauss embittered 
Bolyai by claiming precedence in a letter to Bolyai’s father (see Gauss [199, Vol. 8, 
pp. 220—225]). The subject of non-Euclidean geometry was still controversial when 
Lewis Carroll “repudiated hyperbolic geometry in 1888 as being too fanciful” (see 
the article of Coxeter in COSRIMS [110, p. 55]). Models for hyperbolic geometry 
were obtained first by Liouville, then by Beltrami in 1868, and by Klein in 1870. 
Poincaré rediscovered the Liouville—Beltrami upper half-plane model in 1882 and 


'Non-Euclidean geometry...is the real key to the problem that concerns us. 


A. Terras, Harmonic Analysis on Symmetric Spaces—Euclidean Space, the Sphere, 149 
and the Poincaré Upper Half-Plane, DOI 10.1007/978-1-4614-7972-7_3, 
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this space is usually called the Poincaré upper half-plane, though some call it 
the Lobatchevsky upper half-plane (but see Milnor [469]). Poincaré [517] also 
considered discontinuous groups of transformations of the hyperbolic upper half- 
plane as well as the functions left invariant by these groups and we intend to do the 
same. The geometric foundations for such work were laid by Gauss in 1827 (see 
Gauss [199, Vol. 4, pp. 217—258]) and by Riemann in 1854 (see Riemann [542, 
pp. 272—287]). These important papers of Gauss and Riemann are discussed from a 
modern perspective in Spivak [615, Vol. II]. 

Further progress was made from the 1880s to the 1930s with the investigation 
of Lie groups and symmetric spaces by Lie, Cartan, and others. Ultimately, non- 
Euclidean geometry has become about as controversial as \/—1 = i. Like complex 
analysis, analysis on Lie groups and symmetric spaces such as the Poincaré 
upper half-plane has become an indispensable tool for modern work in physics 
and engineering. We will see many examples, including work on the design of 
microwave transmission lines and the computation of the electrostatic field due to 
a thin charged conductor in the shape of the surface of two intersecting spheres. 
There are many applications to quantum physics in Gutzwiller [239-242], and Hurt 
[308,309]. Some references for this section are Auslander [17], Beardon [31], Buser 
[74], Dym and McKean [147], Hilbert and Cohn-Vossen [297], Svetlana Katok 
[342], Maass [437], and Siegel [596]. 

We shall see that the Poincaré upper half-plane is the symmetric space of the Lie 
group SL(2,IR), the special linear group of all 2 x 2 real matrices with determinant 
1. This group has made many appearances in number-theoretic investigations, going 
back to work of Gauss and others on quadratic forms with integer coefficients. The 
most recent appearance is in the proof of Fermat’s conjecture by A. Wiles with the 
help of R. Taylor. We will say a little more about this in Sect. 3.4. 

Now SL(2,R) is isomorphic to the group SU(1,1) of 2 x 2 complex matrices 
of determinant one which preserve the hermitian form —z,Z; + z2Z2. This means 
that the corresponding symmetric space can also be viewed as the unit disc (see 
Sect. 3.1.2 with its applications to electrical engineering). The group SL(2,R) is 
locally isomorphic to the Lorentz-type group SO(2,1) of real 3 x 3 matrices of 
determinant 1 preserving the quadratic form x +x — x, a group whose analogue 
SO(3,1) is quite important in physics, since it leaves Maxwell ’s equations invariant 
(see Exercise 3.1.13). 

The Poincaré or hyperbolic upper half-plane H is defined by 


H ={x+iy|x,y¢€R,y>0}, where i=V—1, (3.1) 
using a new notion of arc length given by 
ds’? =y? (dx + dy’). (3.2) 


This arc length is not “too fanciful,” because it is invariant under the action of g in 
SL(2,R) on z in H defined by fractional linear transformation: 
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gz = g(z) = (az+b)/(cz+d), (3.3) 


if = (27), with abode R, ad —be=1. 
Cc 


More information on fractional linear transformations can be found in Ahlfors [3], 
Siegel [596, Chap. 3], and almost any book on complex analysis. 


Exercise 3.1.1. Show that the map z > g(z) defined by Eq. (3.3) gives a conformal 
or angle-preserving mapping of H onto H. 


Hint. Note that Im(g(z)) = y\ez+d|~?, ifz=x-+iy, and g'(z) = (ez+d)~?. 


Exercise 3.1.2 (Group Invariance of the Riemannian Metric on H). Show that 
the Poincaré arc length element (3.2) is invariant under the action (3.3) of g in 
SL(2,R). 


Hint. If w = f(z) is a holomorphic function and w = u + iv,z = x + iy, with 
u, v, x, y E€ R, then the Jacobian matrix of the change of variables from z to w is 


du du du du 

J= ox dy \ _ ox oy 
— dv av a du du |? 

ox dy Oy Ox 


by the Cauchy—Riemann equations. Now 


and the determinant of J is |f'(z)|?. In our case f(z) = (az + b)/(cz+ d) with 
a,b,c,d € R and ad — bc = 1. Thus, you easily compute that 


v=ylcz+d|~? and f'(z) = (cz+d)~”. 


The geodesics, or curves minimizing the Poincaré arc length in H, are straight 
lines or semi-circles orthogonal to the real axis. To prove this, we imitate the 
argument that works for Euclidean space and the sphere. It suffices to show that 
the positive y-axis is the curve minimizing distance and passing through i and yoi, 
where yo > 0, for we can find an element g of SL(2,R) that moves any two given 
points in H to i and iyo, for some yo > 0, by Exercise 3.1.3. Now let 


z(t) =x(t)+(t), a<t<b, 


denote any curve in H with z(a) =i and z(b) = iyo. Then the Poincaré length of this 
curve is (for yo > 1) 


b 1 yo 
| y?y?) a> | y dy = log yo. 
a 
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Fig. 3.1 The failure of Euclid’s fifth postulate. All geodesics through z outside the shaded angle 
fail to meet L 


And |logyo| is the Poincaré length of the segment of the y-axis joining i and iyọ. We 
will see that a similar argument works in the higher-dimensional symmetric spaces 
to be considered in Volume II [667]. 


Exercise 3.1.3. Show that given p,q in H, there is a matrix g € SL(2,R) such that 
g(p) =iand g(q) = iyo for some yo > 0. 


Hint. First, move p to i. Then note that k € SO(2) C SL(2,R) implies k(i) = i. 


We can use the Poincaré arc length to define the non-Euclidean distance 
between two points in H. This distance is the Poincaré length of the unique 
geodesic segment connecting the two points. The geometry obtained by considering 
geodesics to be the straight lines is called hyperbolic geometry. It is similar to 
Euclidean geometry in many respects. For example, two points in H determine a 
unique geodesic passing through them. And the non-Euclidean distance makes H 
a metric space. So, for example, the distance on H satisfies the triangle inequality, 
meaning that the length of one side of a non-Euclidean triangle is less than or equal 
to the sum of the lengths of the other two sides. 


Exercise 3.1.4. Show that two points in H determine a unique geodesic passing 
through them. Then show that the non-Euclidean distance satisfies the triangle 
inequality. 


Exercise 3.1.5. We can use the Poincaré metric, which is really an inner product 
on the tangent space to H at a point, to define the angle between tangent vectors 
or curves in H. Show that this notion of angle is the same as the Euclidean angle 
measure. 


But the hyperbolic geometry does not satisfy Euclid’s fifth postulate. For given 
a point z in H not on a geodesic L, there are infinitely many geodesics through 
z which do not meet L (see Fig.3.1, in which all geodesics through z outside the 
shaded region fail to meet L). 

The group of all isometries of H is generated by the fractional linear transfor- 
mations (3.3) and z++ —z. Here isometry means Poincaré-distance preserving. See 
Exercise 3.1.12 for a proof. 
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We can now use the standard formulas from Riemannian geometry to write down 
the SL(2,R)-invariant area element and Laplacian (see formulas (2.2)-(2.6) and 
Auslander [17]). The SL(2,IR)-invariant area element on H is 


du =y dx dy. (3.4) 
The Laplace operator on H is 
o? o? 
Ay =A=y’ | =+  }. : 
H y (2 + 53) (3 5) 


Exercise 3.1.6 (SL(2,R)-Invariance of the Area Element and the Laplacian 
on H). Show that du in Eq. (3.4) and Ay in Eq. (3.5) are SL(2,R) invariant. 


Hint. Recall Exercise 3.1.2 and formulas (2.2)-(2.6). 


Exercise 3.1.7. Show that the non-Euclidean area of a hyperbolic triangle is 7 
minus the sum of the angles. 


Hint. You can prove this directly or note that this is a special case of the Gauss— 
Bonnet formula, since the Gaussian curvature of H is —\ (see Singer and Thorpe 
[604, pp. 175, 191-192], or Auslander [17, pp. 265, 268]). 


It is now possible to explain the term hyperbolic in at least two ways. The term 
is due to Klein and comes from the Greek word hyperballein meaning to throw 
beyond. The first justification is that if two geodesic rays R; and R emanate from 
the ends of a geodesic segment perpendicular to them both, then the non-Euclidean 
distance between R; and R} will increase. The second explanation comes from the 
fact that the Gaussian curvature of H is —1 and thus H resembles a hyperboloid 
of one sheet or a hyperbolic paraboloid (e.g., z = x? — y* at the origin). References 
for this are Auslander [17], Flanders [180], Guggenheimer [235], and Hilbert and 
Cohn-Vossen [297]. We justified the term elliptic similarly in Chap. 2. 

There are other useful realizations of H. The most abstract of them is given in 
the following exercise. Compare it with Exercise 2.1.1 of Sect. 2.1. 


Exercise 3.1.8 (H as a Quotient or Homogeneous Space). Show that H can be 
identified with G/K where G = SL(2,R) and K = SO(2). 


Hint. Map G/K one-to-one onto H by sending the coset gK,g € G, to g(i) = the 
image of i= \/—I under the fractional linear transformation (3.3) corresponding 
to g. 


For connecting Chap.3 and Volume II, [667] the most useful realization of H 
is as the space of positive definite binary quadratic forms of determinant 1 or 
what amounts to the same thing: 


SP2= { Y € R??? | Y positive definite symmetric of determinant 1} . (3.6) 
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A 2 x 2 symmetric matrix Y with real coefficients is positive if Y [x] = ‘xY x > 0 for 
every nonzero column vector x in R?. See Volume II [667] for more information 
about positive matrices. Note that the action of g in SL(2,R) on Y in SP3 is given 
by 


Y > Y(g] = ‘gY g. (3.7) 
We shall use left G-actions here in Chap. 3 and right G-actions in Volume II [667]. 


Hopefully this will not cause too much confusion. 


Exercise 3.1.9 (H as a Space of Positive Quadratic Forms of Determinant 1). 


(a) Show that the following maps are identifications and preserve the group action 
of SL(2,R) on the three homogeneous spaces: 


K\G > SPa > H 


Ke "gg = Proce H with p|] =0, 


where P[A] = ‘APA for any suitable matrix or vector A, as in formula (3.7). 
Here K\G is used to denote the homogeneous space of right cosets Kg, g € 
G = SL(2,R), K = SO(2). Note that you can write P in SP2 as follows: 


Ze) el reer 


with x,y€R, y>0. 


This is clearly possible. Then P | 4 = 0 for z € H implies z = x + iy. 


(b) Show that a geodesic-reversing isometry of SP3 at the identity I is Y =œ Y~!. 
1 


The corresponding mapping in H is z => —z. 
Given P € SP , the spectral theorem for self-adjoint operators on finite- 
dimensional Hilbert space says there is a matrix k in SO(2) such that P[k] = ‘kPk 
is diagonal with first diagonal entry t > 0 and second entry 1/t. We can think of k 
and ź as polar coordinates for P. If you want to use geodesic polar coordinates, you 
would use k and | logt|. This amounts to defining geodesic polar coordinates (r, u) 
ofz = x+ iy in H by 


ee E aa 


x = ysinhr sin (2u), y = (coshr + cos (2u) sinhr)~!. (3.8) 
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Fig. 3.2 Coordinate grid for geodesic polar coordinates 


As u runs from 0 to 27 and r from 0 to infinity, the upper half-plane is covered 
once. To see this, note that the eigenvalues ¢ of the positive definite matrix 
P € SP» corresponding to z as in Exercise 3.1.9 are uniquely determined up to 
order. Formula (3.8) fixes that order by requiring that the first eigenvalue be the 
smaller one. 

Let Kp denote the subgroup of k € SO(2) preserving a diagonal matrix, P € SP2, 
under the action P[k] = 'kPk. This subgroup Kp has order 2 if P is not a scalar times 
the identity matrix. If P Æ cl, for c > 0, the subgroup Kp does not affect elements of 
H at all. 


Exercise 3.1.10. Show that in formula (3.8), r is the Poincaré distance between z 
and i = \/—1. Then show that the orthogonal grid in H created by the curves u = 
constant and r = constant consists of circles and semi-circles, such as those pictured 
in Fig. 3.2. In particular, 


{k(e~i) | r > 0} = semicircle through i orthogonal to thex—axis, 


{k(e~"i) |k € SO(2)} = a circle with center icoshr and radius  sinhr. 


Exercise 3.1.11 (Changing Variables in Geodesic Polar Coordinates). Show 
that in geodesic polar coordinates (r,u) as in Eq. (3.8), we have 
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ds? = dr? + (sinhr)*du’, 
du = y~7dx dy = sinh rdrdu 


ees ete ee a 1 2? 
A= sinhr or (sinh) (sinhr)* 9u? ` 


Use formulas (2.2)-(2.6). 
Another realization of hyperbolic geometry is the unit disc: 
U={zEC | k|<1}. (3.9) 
The Cayley transform 
zew=(z—-i(z+i) (3.10) 


maps H conformally onto U. See Siegel [596, Chap.3], Maass [437, Chap. 1], 
and Helgason [276, pp. 4-7], for more information on this model. The group of 
orientation-preserving isometries of U is SU(1,1) which is the group of matrices 


g € C**? such that 
i-f{1 0\  /1 0 
PL oatye Lasts 


g= Ce with a,b € C and la? — |b? = 1. 
a 


This version of hyperbolic geometry is much used by electrical engineers, as we 
shall see. 


Such g have the form 


Exercise 3.1.12. Use Schwarz’s lemma from complex analysis (see Ahlfors [3, 
p. 135]) to show that the set of all orientation-preserving isometries of the unit disc 
U is the group SU (1,1). Show also that SU (1,1), together with the isometry given 
by z > Z, generates all the isometries of U. Then use the Cayley transform from 
formula (3.10) to translate this result to H. Finally, characterize the geodesics in U, 
using the appropriate SU (1, 1)-invariant arc length. 


3.1.2 Microwave Engineering: The Smith Chart 


Microwave engineers deal with electromagnetic waves at high frequencies and short 
wavelengths from 1m to 1mm (see the chart of the electromagnetic spectrum 
appearing in Fig. 1.8). Different books have slightly different numbers for this 
definition of microwaves. World War II saw the first use of microwaves in order 
to detect planes and ships via radar. Now there are numerous applications to such 
diverse areas as astronomy, communications, cooking, location of speeders on the 
highways, nuclear physics, GPS, satellite TV, weather prediction, wireless local 
area computer networks, motion detectors, cell phones. Non-Euclidean geometry 
comes into the design of microwave transmission lines because the basic quantities 
of interest are related by fractional linear maps (or higher-dimensional analogues). 
Here we consider only a very simple example in which power transfer is increased 
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on a lossless transmission line by connecting short-circuited stubs to the line. 
Further information on microwave theory can be found in the books of Altman [5], 
Baden Fuller [20], Collin [99], Helszajn [285], Hewlett-Packard [293], Lance [385], 
Magnusson [447], Pozar [521], and Staniforth [620], for example. See Border [50] 
and Helton [286-288], for a general treatment of power transfer problems involving 
more complicated circuits with waves of varying frequency. 

Here we shall demonstrate that we are not engineers by using i=/—1 and not j. 
We shall also write z for the complex conjugate of z and not z*, as the engineers do. 

At short wavelengths, electronic circuits behave differently than they do at long 
wavelengths which are larger than the dimensions of the circuit under consideration. 
For example. the circuit parameters vary with position and radiation becomes 
important. If a voltage wave V* exp(—iBd) with current I* exp(—iBd) arrives at 
the end of a lossless transmission line, a reflected voltage wave V~ exp(iBd) with 
current —J~ exp(iBd) is produced. Here the phase constant is B = 27/4, if A = 
wavelength, and d measures distance along the line. If the load is at d = 0, Vz = 
Vt? +V, IL =I* —I-. The load impedance is Z; = V/I; and the characteristic 
impedance is Ze = V'/I* = V~/I~. Define the voltage reflection coefficient 
T =V /V* and the normalized load impedance z = Zz /Z,. It follows that 


VF =(z—1)(¢+1)"! andz=(14+T)(1-Fr)t. (3.11) 


The load is said to be matched with the line when T = 0. This is the condition 
for maximizing power transfer because the power flow 


P=V+T -VT =|vtP{1-|P)P?}/Z. 
These definitions extend to give the reflection coefficient at position —d: 
T(d) = [V~ exp(—iBd)|[V* exp(iBd)|~' =Tzexp(—2ißd). (3.12) 


Then the normalized input impedance (seen looking toward the load) at 
position —d is 


Zin = (1+T(d))(1—T(a))71. (3.13) 


For lossy lines, [(d) = Tz exp(—2iBd —2ad). 

Now |I| < 1 and the mapping Eq. (3.13) from zin to F is a slight variation of 
the Cayley transform just discussed. Write zin = R + iX, where R = input resistance 
and X = input reactance (both normalized). Plotting the image of the lines R = 
constant and X = constant in the T-plane gives the picture in Fig. 3.3. This sort of 
graph is called a “Smith Chart” because it was proposed by P. H. Smith at Bell 
Telephone Labs in 1939. However, Matsumoto [457, p. 41], reports that Mitzuhashi 
had discussed such a chart in 1937. A copy of the actual graph paper (once available 
in any university bookstore and now available in Matlab and Mathematica) is given 
in Fig. 3.4. Note that a movement a distance d along the transmission line changes 
T by exp(—2id); i.e., the point on the Smith Chart rotates by an angle 26d. 
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Fig. 3.3 Constant R and X circles in the reflection-coefficient plane. (From Collin [99, p. 205]. 
Reprinted by permission of McGraw-Hill Book Co.) The bars over letters in this figure do not 
mean complex conjugate. Collin calls the normalized load impedance Z instead of z and writes 
Z=R+ix 


Next let us consider an example of the use of the Smith chart from Collin [99, 
pp. 212-215]. The general idea is to see what various network elements do to a 
given point T. The goal is to finish with T = 0 so that load is matched. First we need 
a few more definitions. Let the normalized input admittance 


Yin = zp = (1-T)(1 +T)! = G+ iB, 


where G = conductance and B = susceptance. 

A stub is a short length of transmission line with a short circuit at its end. 
Consider the double-stub tuner pictured in Fig. 3.5. Let P) on the Smith chart in 
Fig. 3.6 correspond to yz = Gz + iB, = the normalized load admittance. The first 
stub at the plane aa in Fig. 3.5 adds a susceptance iB; which moves P; along the 
G = constant circle to P in Fig. 3.6. Then we get to the second stub at the plane 
bb in Fig. 3.5 by moving in the I -plane from P, to P; in Fig. 3.6, through the angle 
2Bd = 4rd /À, clockwise. The point P4 is obtained by moving along G = constant. 
In order to make P4 = 0 and achieve matching, the point Ps must lie on the circle 
G = 1. You can figure out graphically the value B, must have to do this. In fact, 
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Fig. 3.6 (a) Representation 
of the operation of a 
double-stub tuner. Figure 
continued in Fig. 3.7 (From 
Collin [99, pp. 213]. 
Reproduced by permission of 
McGraw-Hill) 


there may be two solutions or no solution, depending on the value of Gz and d. See 
Figs. 3.6 and 3.7 from Collin [99, pp. 213-214]. It turns out that one can always 
obtain matching with three stubs at fixed positions (see Collin [99, pp. 214—217]). 

Note that z or y lie in the right half-plane, rather than in the Poincaré upper half- 
plane H. To put y in H multiply it by i: 


y=G+iBH iy=—B+iG. 


Then the first stub in the double-stub tuner transforms iy to iy + Bı. This means that 
the stub corresponds to the following matrix in SL(2,R): 


1B, 
01/7 
In general, two-port circuits correspond to 2 x 2 matrices. The terminology does 
not appear to be totally standardized. But we shall describe the correspondence 


given in Helton [286]. The two-port circuit has the diagram shown in Fig. 3.8. For 
a linear two-port circuit, the impedance matrix Z € C22 satisfies ZI = V, where 


[= i and V = vi 4 
h V 


In this section we shall use L rather than Z as the 2 x 2 identity matrix. The 
scattering matrix for the circuit is S = (L—Z)(L+Z)~!. And the chain matrix 
y (the Russian letter cha or che) is defined by 


y YW\ (MN 
—h) Waj 
The power consumption is P = ‘VI = the hermitian product of the two vectors. 


The admittance matrix is Y = Z~!. In Fig. 3.9 we list the chain matrices for some 
common two-ports. 
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Fig. 3.7 (b) Determination 
of required susceptance for 
the first stub in a double-stub 
tuner; (c) Range of load 
impedance that cannot be 
matched when d = 1/8. 
(From Collin [99, 

pp. 213-214]. Reproduced by 
permission of McGraw-Hill) 


Fig. 3.8 A two-port circuit 
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Series 


EI Parallel 


(3) 


Transformer 
Gyrator 


je WO 


kin 0 . 0 r 
0 nik *\ -1/r 0 


Transmission line 


length d Crossed wires 
——— 
d 
cosh r(w) Zi sinh P(w) ( 1 e) 
—i sinh P(w)/Z, cosh P(w) 01 
Fig. 3.9 Chain matrices for some common two-ports. (Based on an illustration in Helton [286], 
by permission of the American Mathematical Society) 


3.1.3 SL(2,R) as a Lorentz-Type Group 


There is yet one more realization of the group SL(2,R) that we should mention. This 
connects the group with Lorentz-type groups that are of great interest in physical 
applications. The connection is developed in the following exercise. 


Exercise 3.1.13. Let 


w= [xioa = cs x ) 


X2 X07% 


weRosis2h, 


and eo = X(1,0,0), ey = X(0,1,0), e2 = X (0,0, 1). For each g in SL(2,R), let s(g) 
denote the matrix of the linear transformation of W given by sending x in W to 
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g x 'g, using the basis {e9,e1,e2}. Show that s : SL(2,R) + G,(2) is a continuous 
homomorphism onto G (2) = the connected component of the identity in the three- 
dimensional Lorentz group O(1,2). Prove that the kernel of s is {+7}. Here by 
kernel, we mean the inverse image of the identity. The Lorentz group O(1,2) is 
defined to consist of all real 3 x 3 matrices which preserve the quadratic form 


ge! 2 2 
Ox = x9 — xi — x5. 


Hint. Note that det(X (xọ,x1,x2)) = x — x — x2 is invariant under s(g). You can 
find more details in Sugiura [648, pp. 206—207]. 


Alternative Discussion of the Preceding Exercise. Consider a 2 x 2 matrix A. 
If e and ez are basis vectors and Ae; = a1;€1 + azje2, then we obtain the second 
symmetric power, denoted Sym? (A), by considering symmetric products (analogous 
to the alternating products appearing in the theory of differential forms): 


2: a 2.2 
Ae, Ae, = aĵe] + 2a11a21e1e2 +4515, 


22 2 2 
Ae: Ae = apne} + 2a12d22€ 1 €2 + 45265, 


2 2 
Ae, -Ae = ayy ay2e, + (a 1422 + a91412)e1€0 + a21d22€3. 


So the matrix of Sym? (A), using the basis e7, e1e2, e2, is: 
2 2 
ayy 411412 ay 


2a11421 411422 +4a21412 2a42a22 
2 2 
azy 421422 459 


This is a representation; i.e., Sym? (gh) = Sym? (g)Sym? (h). 


Next note that we can view SL(2, R) as the group Sp(1,R), the symplectic group 
of 2 x 2 real matrices preserving the alternating form J; i.e., 


f UD, f oiL 
E od n 


Now 
0 01 10 0 
Sym? (J) = | 0—10 | whichis equivalentto | 01 0 
1 00 00-1 


Here equivalence means by change of basis. 


m2 
So now we see that the representation Sp(1,R) = SL(2,R) 2 SO(2,1), 


provided that Sym?('g) = ‘Sym? (g). To make this true, replace Sym? (A) by the 
equivalent matrix obtained by multiplying the second row of Sym? (A) by 1/V2 
and the second column by V2. 
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3.2 Harmonic Analysis on H 


Wie man u.a. in den Werken von Euler und Gauss feststellen kann, haben in friiheren 
Jahrhunderten die Mathematiker nicht nur allgemeine Theoreme veröffentlicht, sondem 
dazu auch Beispiele, die ihnen wohl Vergnügen machten und den Leser weiter belehrten. 


Mir selber erscheint es allerdings nach den entscheidenden Ergebnissen von Gödel und 
Cohen geraten, mit dem ungehemmten axiomatischen Verfahren weiterhin vorsichtig zu 
sein. So hat mir schon immer die Anwendung des Auswahlaxioms widerstrebt, und ich 
fühle mich sogar jedesmal auf meinen Füssen sicherer, wenn ich auf zwei verschiedenen 
Wegen einsehen kann, dass zweimal zwei gleich vier ist. 


—From Siegel [600, Vol. IV, p. 33]. 


3.2.1 Introduction 


Complete sets of eigenfunctions of the non-Euclidean Laplacian 
A =y (d?/dx* +.0°/dy’) 


can be found by separation of variables in either rectangular or geodesic polar 
coordinates. For rectangular coordinates, the spectral resolution of A reduces to 
the inversion formula for the Kontorovich—Lebedev transform which was discussed 
in Exercise 2.2.8 of Sect. 2.2. This result was used by Kontorovich and Lebedev 
[367-369] in the late 1930s to solve various boundary value problems in math- 
ematical physics, such as Dirichlet’s problem for a wedge (see Exercise 3.1.8). 
For geodesic polar coordinates, the spectral decomposition of A boils down to the 
inversion formula for the Mehler—Fock transform. This is due to Mehler [462] 
in 1881 and was proved by Fock [181] in 1943. One motivation for studying 
the Mehler—Fock transform was the need to solve such physical problems as the 
computation of the electrostatic field due to a thin charged conductor in the shape of 
the surface of the region bounded by two intersecting spheres (see Exercise 3.2.23). 

After considering these two different reductions of the spectral decomposition 
of A to that of certain one-variable singular Sturm—Liouville operators, we shall 
consider another approach due to Helgason [276, 280]. This result will then be used 
to find the fundamental solution of the heat equation or heat kernel on H. 


> As one can see from the works of Euler and Gauss, mathematicians of previous centuries not only 
published general theorems but also examples which they might have enjoyed and which might 
have educated the reader. 


I believe that after the decisive results of Gödel and Cohen one should be cautious with 
unrestrained axiomatic techniques. I was always reluctant to use the axiom of choice and am always 
more comfortable when I have two different techniques to see that 2 x 2 = 4. 
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Then we shall discuss the central limit theorem on H, which was first proved by 
Karpelevich et al. [337]. This can be used to study the power reflected by random 
inhomogeneities in a long transmission line, as was first noted by Gertsenshtein and 
Vasil’ev [208]. Such work is related to investigations of Clerc and Roynette [92], 
Faraut [173], Heyer [294], Keller and Papanicolaou [349], Papanicolaou [509], and 
Triméche [685]. There are many applications of such central limit theorems; e.g., in 
demography, learning theory, and atomic physics (see Cohen [96], LePage [412], 
and Hantsch and von Waldenfels [248]). 

There are also many applications of the fundamental solution of the heat equation 
or heat kernel. Rosenberg [547] considers the heat kernel on compact Riemannian 
manifolds and applies it to Atiyah—Singer index theory. 

Thus, by the end of this section, we shall have discussed three versions of 
harmonic analysis on H, along with a few applications. In 1947, V. Bargmann 
essentially obtained the analogue of such Fourier decompositions for the group 
SL(2,R) itself (see Bargmann [25]). We will do little more than mention this work, 
along with a few of the implications for quantum mechanics, at the end of the 
section. 

References for this section include Dym and McKean [147], Gangolli [191-194], 
Helgason [276-284], Lang [389], Lebedev [401,402], Rühl [550,551], and Sugiura 
[648]. 


3.2.2 Harmonic Analysis on H in Rectangular Coordinates 


Harmonic analysis on H in rectangular coordinates x, y will be discussed first. These 
coordinates are the natural ones to use in the study of modular forms (automorphic 
forms for SL(2,Z)), as we shall see in later sections of this chapter. In order to 
carry out harmonic analysis on H in rectangular coordinates, one must be familiar 
with the eigenfunctions of the non-Euclidean Laplacian that transform nicely under 
translation. More precisely, we are seeking functions f(z) which we call k-Bessel 
functions associated to a € R and having the following properties: 


(i) f(z+u) =exp(2miau) f(z), forall we R,z€H, 
(ii) Af = Af, (3.14) 
(iii) |f(z)| < Cy? for some positive constants C, p. 


We call these functions “k-Bessel functions” because they are closely related to the 
classical K-Bessel functions (also called Bessel functions of imaginary argument, 
Macdonald’s functions, and modified Bessel functions of the third kind). 

In order to build up such functions, we make use of the simpler function f(z) = 
(Imz)* = y’, which we call the power function. It is easily seen that this power 
function is an eigenfunction of the non-Euclidean Laplacian; more precisely: 


Ay* = s(s— 1)y’. (3.15) 
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So now we define the k-Bessel function at s € C, z € H, a € R by 


+20 1 $ 
k(s|z,a) = I (im (=) exp(2miau) du. (3.16) 


u=— 0 


Here we assume that Re s > 0. 
Exercise 3.2.1 (K-Bessel Functions). 


(a) The K-Bessel function may be defined (see Lebedev [401] or Watson [721]) by 


K,(z) = zf e -4 («4 *)| 6! dt, (3.17) 


for Re z > 0. Show that if Re s > 0, 


T(s) Te +x)" exp(2irx) dx 


—oo 


S 


=l 
2T °K, 1 (Q\rq|), ifng#0 
lal (5) (s—4), ifr=0,q40. 


Hint. Use T (s) = Jy t7 te™, for Re s > 0, to see that 


T(s) Te +x?) exp(2irx) dx 


oo 


= L exp(2irx) D P(t) es dtdx 


co 


= L exp(2irx) f u’ ‘exp (-u(¢? +x*)) dudx. 
—co 0 
Then complete the square 
—ux? + 2irx —ug? =—u [(x-— ir/u)?+q°+ (r/u)?] : 


Now change variables via w = x — ir /u and use the fact that 


a exp(—w?°)dw = VT. 


—oo 


(b) Deduce from (a) that 


k(s|\z,a/m) = 2./mexp(2iax)I'(s)~! la| 2 9K, (2|ay|), (3.18) 


1 
z 
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when a 40, z= x+iy € H, Res > 0. Show also that k(s|z,0) = 
y (JT (6-4) P69). 
(c) Show that 


Ack(s\e,a) = s(s— 1)k(s|z,4) 
and 
k(s|z-+x,a) = exp(2miax)k(s|z,a). 


Exercise 3.2.2 (Asymptotics and Functional Equations of K-Bessel Functions). 


(a) Functional Equation. Show that K,(z) = K_;(z), if Rez > 0. 
(b) Asymptotics. Show that if s with Res > 0 is fixed and Rez > 0, then 


K,(z) ~ 25 !T(s)z-5, as z > 0, 


[T 
Ks(z) ~ PA as Zz > o%. 


(c) Show that if x > 0 is fixed, then 


= 
Kio) ~ yf Bem sin (2 + tlogt—t flog% ) as t — œ. 


Hint. For the first formula in part (b), you can use the first integral formula in 
Exercise 3.2.1 (a), after making the change of variables v = tz. For the second 
formula in part (b), see Lebedev [401, p. 123], or Olver [508]. For hints on part 
(c), see Exercise 3.2.9 of Sect. 3.7. 


Exercise 3.2.3 (The Negativity of A on H). 


(a) If Af and f are both in Z? (H), show that (Af,f) < 0, using Green’s theorem. 
Here (f,g) = f f(z)g(z)y* dxdy. 
(b) Show that, in fact, (Af, f) < —4 (f, f). 


Hint. This is done by Dym and McKean [147, pp. 277-278]. 


(c) Show that if s(s—1) < 0, then s = 5 +it, t € R, or s € [0,1]. Prove that if 
s(s— 1) <—4, then s ¢ [0,1]. 


Exercise 3.2.4 (Separation of Variables in Af = À f, Using Rectangular Coor- 


dinates). Seek solutions to Af = À f of the form f(x,y) = v(x)w(y). This leads to 
two ordinary differential equations: 
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where k is the separation constant. Then v(x) = exp(27iax) with k = 477a’. If we 
set w(y) = y!/2u(y), show that u satisfies 


1 
yu" + yu! — (iwas +(+ D) u=0. 


Then show that (assuming we seek solutions of at most polynomial growth) if 
a = 0, a solution is w(y) = y', A = s(s — 1), and if a £0, a solution is u(y) = 
Kı (2r|a|y), A = s(s— 1). See Lebedev [401, pp. 109-110], if you need hints. 
Theorem 3.2.1 (Fourier Inversion on H in Rectangular Coordinates). Suppose 
that f € C2(H); i.e., f is infinitely differentiable with compact support in H. Then, 
setting s = 5 + it, 


== f fla, S)ea s(z) tsinh (zt) dt da 
acRJRe s =} 


where 


exp(27iax) VYK,_1 (2z|aly), if a#0, 
as = z i i 
y5, if a=0, 


and 
f(a,s) =| £ z)€a,s(Z) y? dx dy. 


Proof. By the Fourier inversion formula of Sect. 1.2, this result is easily reduced 
to the inversion formula of Kontorovich and Lebedev [367-369] for functions 


h(y), y> 0: 
= =f t sinh (7t) Kil o SE udi 


a= ee Kir( ah CK) dudy, 


2 


(3.19) 


This is true because it suffices to prove the theorem when f(z) = v(x)w(y), if z= 
x+y. 

There are many ways to find the spectral measure 277~*t sinh (zt) in Eq. (3.19). 
Three ways are given in Exercise 2.2.8 of Sect. 2.2, Exercises 3.2.16 and 3.2.21 of 
this section. What we call the asymptotics/functional equations principle gives a 
simpler method. We shall not prove this principle rigorously here, but instead give 
a plausible argument for its truth. The method comes from work of Harish-Chandra 
and Helgason. There is a rigorous proof using Paley—Wiener theory in Helgason 
[280], for the special case under consideration. The generalization to symmetric 
spaces of semisimple Lie groups has been obtained. But no one seems to have 
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developed the principle as a general method in spectral theory. See Exercise 3.2.6, 
the discussion of the Mehler—Fock inversion formula (3.26), (3.27), and Volume 
II [667] for more examples of the asymptotics/functional equations principle. From 
Exercise 3.2.2 above, we believe that the asymptotic behavior of Kj; as y approaches 
0 is given by 


Ki) ~ 2" Ty 242" TR-n)y assy 0". (3.20) 


This can also be proved using the relation of K to the /-Bessel function and the 
power series for /. Note that the line of integration in the Fourier inversion integral is 
the line fixed by the functional equation of K; = K-s. And the two terms in Eq. (3.20) 
come from the functional equation. 

To prove Eq. (3.19), one wants to see that the kernel 


1 +R K; K; 
Wa(x,y) = a, + sinh (mt) SERED) ay 


Jy 


approaches 6(x— y) as R — ce. Since our problem is invariant under SL(2,R), it 
is reasonable to expect that it suffices to consider only x,y ~ 0. But then formula 
(3.20) and 


(it) (—it) = x(t sinh(at))~! (3.21) 


(see Lebedev [401, p. 3]) imply that 
1 fP ii -lyi + 
We) ~ =f y 2-"x "2 dt, x,y— 0%. (3.22) 
2m J-R 


And the right-hand side of Eq. (3.22) is a Dirac delta family by the Mellin inversion 
formula (see Exercise 1.4.1 of Sect. 1.4). So the spectral measure is chosen just to 
cancel out the gamma factors in Eq. (3.20). 

There are also discussions of the Kontorovich—Lebedev transform in Lebedev 
[402] and Sneddon [609, Chap. 6]. Note that one must show that the other terms 
in the asymptotic expansion (3.20) do not contribute to the inversion formula. One 
might also ask how one knows that there is no discrete spectrum. 


Exercise 3.2.5. Show that it does not matter which of the two inversion formulas 
in Eq. (3.19) you prove. 


Exercise 3.2.6. Can the asymptotics/functional equations principle be used to 
compute the spectral measure for the Hankel integral formula of Exercise 2.2.7(b) 
in Sect. 2.2? You want to show that: 


R 
f rJy(rx)Jy(ry)r dr~ d(x—y) as R—> æ, 
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Jy (x) | Zcos (x Mal T) as x > œ, V > : 
vit 4 ? ’ 3? 
and 

Jy(—x) = exp(vri)Jy (x). 


Exercise 3.2.7 (Some Kontorovich-Lebedev Transforms). 


(a) Show that the integral formula in Exercise 3.2.1 (a) implies 
Ki,(z) = f exp|—zcoshu]cos(tu) du, Rez > 0, tarbitrary. 
Use this to deduce that 
S exp(—zcoshu) = [ (z)cos(tu) dt, Rez>0. 
(b) Show that when Re(r) > |Re(s)|, 


[ x15.) ay =2°r se Ce 
0 2 2 


Again you can use the integral formula in Exercise 3.2.1(a). For more hints, see 
Exercise 3.6.4 of Sect. 3.6 (the section in which we apply this result). 


Note. You can find more examples in Erdélyi [165, Vol. II, pp. 125-153, 175-177], 
and Oberhettinger [499]. 


Exercise 3.2.8 (An Application of the Kontorovich-Lebedev Transform). Find 
the harmonic function in the wedge in Fig.3.10 taking on given values on the 
boundary of the wedge; i.e., solve 


Au(r,9,z)=0 in0<r<o, |o| <a, zER, 
u(r, —a, Z) = f(r,z), 


u(r,a,z) = g(7,2). 
Here (r,@,z) denote cylindrical coordinates in R? and f,g are given functions. 
Obtain solutions of 


u 1a 1 07u u 


A= I rar ! TE = 
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Fig. 3.10 A wedge 


of the form u(r, ¢,z) = exp(+ọ¢t + icz)R(r). Then R(r) satisfies 


R" +r 1R + (Pr? — eR = 0. 


One solution is Ky (r|c|). Use the Kontorovich—Lebedev transform to construct u 
from this. You can find the answer in Sneddon [609, pp. 363—366] or Lebedev [401, 
pp. 150-153]. 


3.2.3 Harmonic Analysis on H in Geodesic Polar Coordinates 


Next we consider harmonic analysis on H in geodesic polar coordinates (7,1) 
given by 


(3.23) 


z=xtiy=k,ei, with k= ( Ao a 


— sinu cosu 


In Sect. 3.1, after formula (3.8), we noted that as u runs from 0 to 27, and r runs 
from 0 to œ, the upper half-plane is covered once. Recall also that we computed 
ds*, du, and A in geodesic polar coordinates in Exercise 3.1.11 of Sect. 3.1. 


172 3 The Poincaré Upper Half-Plane 


In order to do harmonic analysis on H in geodesic polar coordinates, one must 
have some knowledge of the associated Legendre functions P“ Lyi These are 
-2 


essentially the eigenfunctions f of A on H which transform under k, defined in 
Eq. (3.23) according to the law 


f (kuz) = exp(2iau) f(z), zEH,uER. (3.24) 


Using the power function y* to construct functions satisfying Eq. (3.24), as we did 
in formula (3.16), we define the associated spherical function 


2m 


1 f 
h(s|z,a) = nh Im(k_;z)* exp(2iat) dt. (3.25) 


This turns out to be an associated Legendre function as the following exercise shows. 


If a= 0 in Eq. (3.25), we are looking at a zonal spherical function, invariant under 
rotation, which is a Legendre function. 


Exercise 3.2.9 (Associated Legendre Functions). The associated Legendre func- 
tion P“(z) can be defined for Re z > 0 (see Lebedev [401, p. 199]) by 


Tr 1) [27 s 
PH(z) = ef [z+ z— Icosu| exp(iau) du. 


Show that 


T(1—s) 


~ P” (cosh if z= ke'i. 
Td —s+2a) ““(coshr), if z e'i 


h(s|z,a) = exp(2iau) 
Hint. Note that Im(k,e~"i) = [coshr + cos(2u) sinhr]~!. 
Exercise 3.2.10 (Asymptotics and Functional Equation). Define the Legendre 
function by P, = P°. 


(a) Functional Equation. 

Show that if Re z > 0, then P_s(z) = Py_1(z). 
(b) Asymptotics. 

Show that if Re z > 0, Re s > —4, then 


1 T(s+4 l 
P (z) ~ A TUE ons as z — œ. 


Jn T(st+1) 


Hint. Use the integral formula in Exercise 3.2.9 and make the change of 
variables x = tan(u/2). That leads to the formula 


1” a= 2 = 1 
post f fer j (a 1) } Pa 
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This integral approaches 


2z)° 2z)" 1 1 
Cy | (x7 +1) dx = (22) B| =,st+=], as z>~%, 
T JxeR T 2 2 


where B(a,b) = the beta function =T (a) (b)/T (a+ b) (see Lebedev [40], 
Chap. 1]). 


Exercise 3.2.11 (Separation of Variables in Af = À f in Geodesic Polar Coordi- 
nates). Write f (kue "i) = v(r)w(u) with (u,r) = geodesic polar coordinates in H 
and solve Af = Af. You will obtain 


sinhr . . j , ; 23 
or) [sinhr v'(r)| —A(sinhr)* = k= aa 


where k is the separation constant. Set w(u) = exp(iau), a € Z. Then k = a” and v 
must be a solution of 


sinhr (sinhr v’(r))'— (A sinh? r +a”) v(r) =0. 
Set x =coshr, V(x) = v(r). Show that V must satisfy the ODE 
a 


2. 
ee a ae a (a+ :) V=0. 


Set A = s(s— 1) and obtain a solution V(x) = P“%,(x) (see Lebedev [401, p. 214]). 


The functions P* re (x) were first considered by Mehler [462]. They came to 
be called conical functions because they arise in physical problems involving cones. 
We call them spherical functions because they are a special case of the (associated) 
spherical functions attached to any symmetric space, examples of which will be 
considered in Volume II [667]. The Legendre functions can also be viewed as Gauss 
hypergeometric functions 2F; (see Lebedev [401, p. 165]). This gives the power 
series in the next exercise by definition. We will be most interested in the zonal 
spherical functions (the case a = 0). 


Exercise 3.2.12 (A Few More Properties of Associated Legendre Functions). 


(a) Show that if |z— 1| < 2 we have 
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where 


b= 1, ifk=0; 
k l s(s+1) e (s+k—1), ifk=1,2,.... 
and 2F; = the Gauss hypergeometric function (see Lebedev [401]). 
(b) Show that if a = 0,1,2, ..., then P@(z) = (2 — 1)*/?D*P,(z), if D = d /dz. 


(c) Show that P! (z2) = P-“(z)I'(s+a+t1)/T(s—a+t1) for a= 0,1,2, .... 
(d) Show that if 
Z 
E 


then 
P7 ° (a) = (Ê —1)-*71P,(2). 


Note that P(x) is a polynomial when both a and v are nonnegative integers; 
in fact, they are the functions that arose in the solution of the analogue of 
Exercise 3.2.11 for the sphere in Chap. 2. We will not have anything to say about 
Legendre functions of the second kind until Sect.3.7 (but see Hermann [292, 
Chap. 3]). Another reference for conical and spherical functions is Robin [543]. 


Exercise 3.2.13 (Laplace Transforms Relating K and P). Prove that the follow- 
ing integral formulas hold. 


(a) 


> u— T —u —u 
[T otot, aldi = f ET uT utv D- PM), 
if Re(u+v)>-—1, Re(u—v)>0, Rep>-l. 


(b) 


= PA 
| P _ı(u)exp(—ut)du = [EK if Ret >0. 
1 


Z 


Theorem 3.2.2 (Harmonic Analysis on H in Geodesic Polar Coordinates). Let 
(r,u) denote geodesic polar coordinates as in formula (3.23). And define 


Eas (kue "i) = exp(iau)P“, _, (coshr). 


3+it 


Then, if f € CZ (H) = the infinitely differentiable functions with compact support, 
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fo=— > (-1) bw Flat)e-ap(z) ttanh (7t) dt. 
where 


f(at) = TEORIO du. 


Proof. We show in Exercise 3.2.14 that it suffices to prove the inversion formula 
for the Mehler—Fock transform (see Mehler [462] and Fock [181]): 


g0) = | rtanh(a) Pap) f Pal) dwd, 820 


f(w) = wtanh ( nw) f P ijilu uf P byw) F(t) dt du. (8.27) 


There are discussions of this result in Exercises 3.2.15 and 3.2.17 below. We 
can also obtain Eqs. (3.26) and (3.27) from the asymptotics/functional equations 
principle which we used to discuss Theorem 3.2.1 and the Kontorovich-Lebedev 
inversion formula. For by Exercise 3.2.10, if |argx| < 7, then 


ae T(it) y) 3 tit I'(—it) 
)” Taran) ar-a) 


for fixed real t. This can also be proved by noting that the Legendre function is a 
Gauss hypergeometric function (see Lebedev [401]). Helgason [280, pp. 62-82], 
gives a discussion of the complete asymptotic expansion of the spherical function 
and its application to Fourier inversion on the symmetric space via Paley-Wiener 
theory. 

Our goal is to show that the kernel 


(2x) 727" as x — œ% 


R 
Vey) = f ttanh (m) P_} y (@)P yp 0)di 


approaches ô(x— y), as R — ©. As for the kernel (3.22), one can argue that the 
SL(2,R)-invariance of the problem means that it suffices to consider x, y ~ c. Now 
by a standard property of the gamma function (see Lebedev [401, p. 3]) we have 


rÐ) 1 


= . 2 
nr (4+it)r (4 —it) mt tanh(zr) aam 


It follows that 


Lf as 
we~ f x 2tty "2" dt for x, y~ o. 


176 3 The Poincaré Upper Half-Plane 


The right-hand side of the asymptotic relation is a Dirac delta family by Mellin 
inversion (see Exercise 1.4.1 of Sect. 1.4). Thus the spectral measure was chosen 
so as to cancel out the gamma factors in the asymptotic formula for the Legendre 
function, as was the case for the Kontorovich—Lebedev transform. 

The rest of the discussion of Theorem 3.2.2 is outlined in the next exercise. 


Exercise 3.2.14. Suppose that / and D are as in Exercise 3.2.12. 


(a) Use Exercise 3.2.12 and integration by parts to show that if f(k,e~"i) = 
g(x)h(u), with x = coshr, then 


fla, t) y= fo [Tw e” g w)(w? _ DPE ig dvdw 


27 
W i f h(v)e® P 1, ,(w)D* |g(w)(w? = 1)*] dvaw. 


(b) Use Exercise 3.2.12 to show that 


— Y(-1)4 F(a,t)€—ar(z)t tanh (zt) dt 
ag Nf, F00 Or tanh (ar) 

— ES (=ne f yf T poje” 
2m aeL teRt w=1/v=0 


xD" [Loo = o Py (wdw? —1)PIP_; , (x)ttanh (mr) dt. 


(c) Finish the proof of Theorem 3.2.2 by using the Mehler—Fock inversion formula 
and the fact that A(u) is represented by its Fourier series. 

(d) Show that €4(z) = €-a—+(z) = E€-a 1 (2). 

Exercise 3.2.15. Can you find a proof of the Mehler—Fock inversion formula 

which is analogous to the proof of the Kontorovich—Lebedev inversion formula in 

Exercise 2.2.8 of Sect. 2.2? 


Exercise 3.2.16 (Another Derivation of the Kontorovich—Lebedev Inversion 
Formula). 


(a) Use the integral formula in Exercise 3.2.1 to see that - (y) is the Fourier 
transform on R of hy(x) = 5 exp(—ycoshx); i.e., Ku (y) = hy(t/27). 

(b) Use the multiplication formula for the Fourier kaanon on R (see 
Theorem 1.2.1 of Sect. 1.2) to show that if f(x) is extended to R by setting 


f(x) = f(x), then 


2 üj 1 
axsinh (aa) f K0) f Si RAG u)Kiu(y) dudy 


1 j 1 
= —xsinh (rx) i. Kix(y)— 
y=0 


hee A 
2m2 a (>) exp(—ycoshu) dudy. 
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(c) Show that the Kontorovich—Lebedev inversion formula (3.19) follows from 
Fourier inversion on R (Sect. 1.2) and 


cos(xu) 


a 1 
—ycosh UK, dy= ; 
le x0) y 7 "sinh (1x) 


This last integral formula follows from part (a) of Exercise 3.2.13, since 


— (<1 z i)" (c+ #1) "*) 
z—1)4 


(see Erdélyi et al. [164, Vol. I, p. 150]). You also need formula (3.21). 


p? (z) = 


Exercise 3.2.17 (Another Derivation of the Mehler—Fock Inversion Formula). 
Use Exercise 3.2.13(a) to see that 


P Ayme iya t) dtd 
fee -jiw lu) tg (u8 (e) u 


= %7 aada oop Em. of Ki(v)ge(t)cosh (at) dt dudv. 
v 


Then use Exercise 3.2.13(b) and the Kontorovich-Lebedev inversion formula to 
prove the Mehler—Fock inversion formula. 


Exercise 3.2.18 (Some Mehler—Fock Transforms). 


(a) Show that if Re a > t z» then 


24-2 a+it—} a—it—} 
ap dw = T 2 |r 2 j. 
1 WEE ae) Ov T(a)/n ( 2 ( 2 


(b) Prove that if Rea > 0, Re b > 0, then 


x —v a? + b? + 2ab 2 
I exp|- va? + b? + 2a Ma sw) i= 
1 va? +b? + 2abw 2 mV ab 


There are many other examples of Mehler—Fock transforms in Erdélyi et al. [165, 
Vol. II, pp. 320-326]. Mathematica should be able to provide examples as well. 
See Vilenkin [704] and Wawrzyriczyk [722] for discussions of spherical functions 
attached to group representations. We will have more to say about this subject in 
Volume IT [667]. 
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3.2.4 The Helgason Transform on H 


Here we give our third and last version of non-Euclidean harmonic analysis on 
H, that of Helgason [276, 280]. Suppose that f € C7 (H). Define the Helgason 
transform of f for s € C, k € SO(2), by 


Uf(s,k) = f, f(Oim(R(O) y dady. (3.29) 


Set B = K/M, where M = {—1,]}. Then B is called the boundary of H. This is 
reasonable since B can be identified with the circle, which is just the one point 
compactification of the real line. Note that Hf(s,k) depends only on the coset kM 
of k in B. We will see in Lemma 3.7.3 of Sect.3.7 that the transform Eq. (3.29) is 
the same transform which number theorists call the Selberg transform, when f is 
K-invariant (see Kubota [375, p. 56]). 

The boundary of H has the non-Euclidean analogues of properties of the 
boundary of the unit disc. For example, it is possible to generalize classical potential 
theory in the framework of symmetric spaces (see Helgason [276], Koranyi [370], 
and Volume II [667]). Eigenfunctions of invariant differential operators on H can 
be shown to be given by a Poisson integral over the boundary (even in the higher- 
rank case) as was shown by Helgason (see also Kashiwara et al. [338]). The 
precise statement of this result requires Sato’s theory of analytic functionals or 
hyperfunctions. These are elements of the dual space A’ (B) to the space A(B) of 
analytic functions on the boundary. The space A(B) has a natural topology which is 
described for B = the circle as follows. Let U be an open annulus containing B and 
A(U) = the holomorphic functions on U topologized by uniform convergence on 
compact subsets. Identify A(B) with the union of all the A(U) (with the inductive 
limit topology). The eigenfunctions are then just these analytic functionals acting 
on the Poisson kernel, which is just the power function. 


Theorem 3.2.3 (Helgason’s Version of Harmonic Analysis on H). 


(1) Fourier Inversion on H. 
If f € CZ (H), then 


1 1 f? 1 ; 
f@= leant Hf (5 +i.4,) Im(kyz)2*" ttanh (mt) dudt, 
t u=! 


where k, € SO(2) is defined in formula (3.23). 
(2) The Paley—Wiener Theorem. 
The map f œ Hf takes C7 (H) one-to-one, onto the space of C” functions 
G(s,k) on C x SO(2) which are holomorphic in s and have properties (a), (b) 
below: 


(a) Gis of uniform exponential type R: 


sup Coase + isl)" IG(s,4)1) <œ foreach NE ZT. 
sEC,kESO(2) 
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(b) G satisfies the functional equation 
27 tas 1 
| Imatra (5+ ark du 


27 1 1 
= f Im(kuz)2? "G G — iku) du. 
u=0 2 


(3) The Plancherel Formula. 
The map f + Hf extends to an isometry mapping L’(H,du) onto L? (R x 


K, zizt tanh (7t) dt du), where K = SO(2) is identified with the interval (0, 271] 
using the map sending kų to u. 


Proof. We shall only discuss part (1). See Helgason [276, 280] for an alternate 
derivation of part (1) of the theorem, as well as proofs of parts (2) and (3). We will 


follow Helgason’s method for SL(n,R)/SO(n) in Volume II [667]. 
First let us relate the transform f(a,t) of Theorem 3.2.2 to Hf (4 +it,k). Set 


c(a,t) =T (5 +i+a) [zar (5 +#)| E 


Then, by the integral formula in Exercise 3.2.9, we have 
va 2n oy =F s ii . 
f(a,t) = I, fie "i)e™ P? japi (coshr) sinhr drdu 
an is =r ; „iau ii ia@ —r.\]1/2-it .. 
= c(a,t) f(k_ue “ie e [Im (koe i)] sinhr d0 drdu 
u=0 J r=0 0=0 


2m 2m poo , 
= c(a,t) 1 ent / f(k-ue™i)Im(kok-ue™"i)! 7" sinhrdrdud@. 
o=0 u=0 J r=0 
To obtain the last equality, we substituted ọ = u + 0. Thus 
. 27 1 , 
f(a,t) =c(a,t) | uf (5 +inke) “Pd. 
o=0 


By Theorem 3.2.2, Exercises 3.2.9 and 3.2.10, we have 


1 foo 2m 1 A 
Tj) = — —)% lap ee Be? —iau p—a 
f(k-ue i) In He 1) can f e ni (5 +itsk e PZ t „y ooh?) ttanh (zt) dọ dt 
L gay [7 eaea) 
= — — c(a —a,— 
4n del ae 


2m 1 A 2r : A 
x f e Hf G +isko) am e 9 Im(kge™"i) 2+ tanh (at) d8 dọ dt. 
o=0 @=0 
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It follows from I'(z)'(1 — z) = wesc (7z) that c(a,t)c(—a, —t) = (—1)"(2)~? (see 
Lebedev [401, p. 3]). Therefore 


2n 2n 
Jei =a 5 o t tanh (1) f 7 f _  oxPlia( pu — 8) 


acZ 
x (Im(kge~ "i)) zit ur(5 + it, ke) d0 dọ dt. 


Lis 
Now (Im(kọe™"i)) 7+", as a function of 0, is represented by its Fourier series, which 
implies 


ma 
Hf a tanh (nt) (Im(kpk-ye~i)) 2" "ur (žito) dtdg. 


This completes the proof of part (1) of Theorem 3.2.3. 


Helgason [276, pp. 9-10], gives a much more elegant reduction of the proof of 
Theorem 3.2.3, part (1), to the case of K = SO(2)-invariant functions f € CZ (H). 
The idea is to imitate the discussion of Fourier inversion on R” which was given in 
Sect. 1.2. Define the inverse transform S for functions h : R x B— C by 


1 27 , 
sias Í h(t ky) (Im(Kyz))2*# ttanh (mt) dt du. (3.30) 
87? ER Ju=0 
Then Helgason proves that 


| £ SHsdu= | (supe du, (3.31) 
H H 


and that SH commutes with the action of G = SL(2,R) on H. We shall discuss this 
proof in detail for G = SL(n,R) in Volume II [667]. 

Next we want to consider the Helgason transform of K= SO(2)-invariant 
functions. Suppose f(z) = f(kz), for all k in SO(2) and z in H. Then the Helgason 
transform of f is 


Als) = Hs (6.8) = Hs.) = | FOF? dx dy 
= =m | fle i)P_ Lait (coshr) sinhr dr, (3.32) 
ifs= 5 + it, and J = the 2 x 2 identity matrix. The inverse transform S for f(s) 
is a transform which takes functions F : R — C onto functions SF with domain H, 


defined by 


SF (kei) = = f F()P_1,,(coshr) rtanh (mt) dt (3.33) 
i 
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for k in SO(2) and r > 0. Then if f € CZ (H) and f(k(z)) = f(z) for all k € SO(2) 
and z € H, the Fourier inversion formula of Theorem 3.2.3 is 


fos. (3.34) 


This is nothing else but the Mehler—Fock inversion formula (3.26) and (3.27). Note 
that for rotation-invariant functions on H, the Helgason transform is essentially an 
inner product with the zonal spherical function P_1 | ;, (coshr). 

2 
Exercise 3.2.19 (Convolution). 


(a) Let f,g : SL(2,R) = G —> C. Define the convolution f * g by 


(F+8)(a)= | Oga) ab, 


where db = a right- and left- invariant Haar measure on G. Suppose that either f 
or g is really a function on K\G/K; i.e., it is a K-invariant function on H, where 
K =SO(2), as usual. Show that H(f*g) = Hf- Hg. Deduce that L! (K\G/K)) 
is a commutative algebra under convolution. 

(b) Show that you should not expect the convolution property to hold unless either 
f or g is K-invariant. 


Hint. In Volume II [667], we generalize this exercise to SL(n,R), the special linear 
group of n X n matrices of determinant 1. The proof may be slightly clearer in the 
general context. It uses a property of the power function. 


Exercise 3.2.20 (Writing the Geodesic Radial Coordinate r in Terms of the 
Rectangular Coordinates x,y). Let x+ iy = k,e~'i as in formula (3.23) above. 
Note that this means that for some v, we have 


u(i aia) erro” exe) 


Deduce that 


M'M=k, ae 
0 æ 


and that 


Tr(M 'M) =y+ (1+3?) /y =2coshr. 


The following exercise concerns a transform that will appear again as part of the 
Selberg trace formula in Sect. 3.7. 

We spoke of geodesics in the last section. Another kind of curve in H is of interest 
to us as well. This is the horocycle. The prime example is a horizontal line in H. Any 
image of such a line by an element of SL(2,R) is also a horocycle. Figure 3.11 shows 
horocycles through the origin. The horocycles are all orthogonal to the geodesics. 
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-1.0 -0.5 0.5 1.0 


Fig. 3.11 Some horocycles in H 


Exercise 3.2.21 (The Helgason Transform of K-Invariant Functions Is a Com- 
position of Harish and Mellin Transforms). 


(a) Define the Harish transform (also called Abel or horocycle transform) T f of 
f: H —> C by 
1 
=] f 
Jy xER 


See Lang [389, pp. 69 ff]. Let Mg denote the Mellin transform of g: Rt > Cas 
in Sect. 1.4. Show that the Helgason transform of a K-invariant function f, K = 
SO(2) G.e., f (kz) = f(z) for all k € K and z € H) is given by 


f(s) =MTf (s- 5) l 


T f(y) (x+ iy)dx. 


(b) Show that if g: Rt > C and 
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Table 3.1 Short table of Helgason transforms 
FO = fke) = fei) f(s), s=>+it 
eT/4,/3(4nT)-3/2 f° be AT ab es- DT 


cosh b—coshr 


(coshr)“, a>0 Jn2 r 


(afre 
exp(—acoshr), a>0 2(27/a)! *K,_1 (a) 


then 


—1 w? 
=< | Gl ya | aw. 
o= fe (v5) aw 


(c) Set f(x+iy) = g(coshr), when x + iy and r are related as in Exercise 3.2.20. 
Show that T f(y) = G((y++y~!)/2), where G is the transform of g defined in 
part (b) and T is defined in part (a). 

(d) Use (a) and (b) to deduce the inversion formula for the Helgason transform of a 
K-invariant function at z = i (see Helgason [280], pp. 79-82). 


Hint. Mellin inversion implies that 


o(? He) = zh flit)y" dt. 


Set t = e” and show that 


Econ — f wy 


27 JreR sinh(v) 


Then use part (b) to see that 


=f / 2 
= — 1 2 
F Lae +w*/2) dw 


sh(v/2) 
t) t sin(tv) —_——— dtdv 
~ An? ZA a da E sinhv 
Here we changed variables according to coshv = 1 + w*/2, dw = cosh(v/2)dv 
We give a very short table of Helgason transforms. See Table 3.1. 
Exercise 3.2.22. Check Table 3.1 which lists three Helgason transforms. 


Hint. For the first line of Table 3.1 use Exercise 3.2.21 to see that if f(s) = 
ess“ s= 5 +it, then 
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1 1 
F(coshr) = F la exp (- G +?) r) exp(irt) dt 


1 ( T? + 2) 
= ex . 
2V TT p 4T 


Also 


; -1 
fle" = — F'(coshr+ w?/2) dw 
27 JweR 


E =f F'(coshv) sinhvdv 


\/2(coshv — coshr) 


No one seems to have evaluated this integral beyond what is in Table 3.1 (a result 
that would be of interest for work on the central limit theorem on H and its 
applications). 

For the second line, see Exercise 3.2.18. 

For the third line, see Exercise 3.2.13. 


Exercise 3.2.23 (A Conductor That Is the Surface of Two Intersecting Spheres). 
(a) Toroidal coordinates (a,b,@) are defined as follows in terms of cylindrical 
coordinates (r,@,z) (see Lebedev [401, p. 222): 


sinha sinb 


e— —— z=¢—___—_ 
cosha — cosb’ cosha — cosb’ 


where c is a constant. Show that the surface b = constant is a sphere 


2 
-ccotb)? +P = (©) . 
RECO E sinb 


(b) Show that 
Af = fat fyt fe 
a sinha of P d sinha of 
~ ða \ cosha — cosb ða Ob \ cosha — cosb ðb 


1 Of 
` (cosha — cosb) sinha 0¢2" 


(c) Consider the Dirichlet problem on the domain in Fig.3.12. Here the z-axis 
passes through the center of the two spheres. Choose c equal to the radius of 
the circle of intersection. The two spheres are given by b = bı and b = bp in 
toroidal coordinates. Then the Dirichlet problem is to find u with 
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Fig. 3.12 A slice of two 
intersecting spheres 


Au = 0, bı <b <b, 
Ulp=b; = fji j=1,2 


Show that separation of variables leads to a solution which has the form 


= F> sinh ((b—b; )t) +F; sinh ((b2—b)t) 
sinh ((b2—b1 Jt) 


u=/ 2(cosha— cosb) | P_ 1 u (cosha)dt, 


where 


F;(t) 


(cosha) sinhada, 


) = ttanh( (mt) f P: i 
4/2 ais 2 


Hint. See Lebedev [401, pp. 227-230]. 


3.2.5 The Heat Equation on H in Rectangular Coordinates 


The problem is to find u = u(z,t) = the temperature at z in H and time ¢, if u(z,0) = 
f(z) is the initial heat distribution. That is, we want to solve the heat equation on H : 


= Au =y? (fat ga) ult in) | (3.35) 
u(z,0) = f(z). 
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Separation of variables leads us to consider elementary solutions of the form 
u(z,t) = Z(z)T (t), where 


Set k = —(v?+ 4) =s(s— 1), if s = 5 + iv. Then T(t) = exp|— (v? + 4)t] and 
Z(z) = ea s(z), as in Theorem 3.2.1. Then we can use the spectral decomposition of 
A in Theorem 3.2.1 to obtain 


1 
u(z,t) =< | f vsinh (mv) A(b, v)e, 1, (ze (+4) dvdb, (3.36) 
T? JbeR JveR bz tiv 


where 
Z ae” 
Alb») = | Sga dxdy. 


Then Exercises 3.2.24 and 3.2.25 below show that this solution can be rewritten in 
the form 


u(z,t) = f*G, 


G,(kew"i) = $ Jnew btanh(1b)P_1 y 


(coshr)e~ (+4)! db | (an) 
where convolution of functions on H is induced from convolution of functions 
on SL(2,R) as in Exercise 3.2.19. A function f : H > C induces a function f : 
SL(2,R) — C by writing f(g) = f (gi) for g in SL(2,R), i = v—1. The function G, 
in Eq. (3.37) is the fundamental solution of the heat equation or heat kernel and 
thus gives rise to the non-Euclidean analogue of the normal distribution discussed 
in Example 1.1.3 of Sect. 1.1 and Example 1.2.1 of Sect. 1.2, as well as in the 
discussion of the central limit theorem at the end of Sect. 1.2. We will obtain a 
more direct treatment of formula (3.37) using Helgason’s transform, after the next 
two exercises. Then we will develop the non-Euclidean central limit theorem for 
SO(2)-invariant random variables. 


Exercise 3.2.24. 


(a) Prove that if z = x + iy and w = u + iv in H, then we can write Eq. (3.36) as 


u(z,t) = [foe (z,w)v? du dv, 


where 


G;(z,w) = ve f exp (- (2+ z) r) rsinh (zr) 


x | exp|2mib(x — u)|Ki,(27|b|v)Ki,(2a|bly) dbdr. 
bER 
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(b) Show that 
E f er 
f vodite ion a Py (2 *) l 
b=0 


4Vab 2ab 
(c) Use (a) and (b) to prove 
1 
G;(z,w) = zS e (Otay btanh (zb) P, 
4T JbeR 
where r(z,w) is the geodesic radial coordinate [see formula (3.23)] of the point 
M7! M;i, if M;i = z and M,,i = w for M;, Mw in SL(2,R). In particular, we have 


(FBP 


—r/2 
= mn) k,k €K, 


= (coshr(z,w)) db, 


= 0 et/2 


y+ (1 + x”) /y =2coshr, as in Exercise 3.2.20. Then use Exercise 3.2.25 to see 
that 2coshr(z,w) = (y? +v? + (x—u)*)/(vy), if z and w are as in part (a). 
(d) Prove formula (3.37). 


Exercise 3.2.25. 
(a) Suppose that z = x + iy and use the same matrix M, as in part (c) of 


Exercise 3.2.24. Similarly, define M,, for w = u + iv. Show that 


M.M-'i= (=e) ETE E 
4 v v v 


(b) Then write M;!M;i = kue "i, where r = r(z,w), k € K = SO(2) are the 
geodesic polar coordinates for M; M;i. And show that 


2coshr(z,w) = y +v +(x u)’] /(vy). 


3.2.6 The Heat Equation on H Using Helgason’s Transform 


This section is modelled on the discussion in Gangolli [191, 192]. We want to 
consider the non-Euclidean analogue of the method that was used in Example 1.2.1 
of Sect. 1.2 to find the fundamental solution of the Euclidean heat equation. We 
shall assume that f(kz) = f(z) for all k € SO(2) and z in H. Then take the Helgason 
transform with respect to z of the PDE in Eq. (3.35). This gives 


Huy; =H (Au) . 


Integration by parts or Green’s theorem says that 
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d 
3, tH =s(s—1)Hu. 


Thus Hu = Hf -e*°—)", Using Exercise 3.2.19, it follows that 


u(z,t) = fS (ee), 


where S denotes the inverse transform Eq. (3.33): 


1 
= Fa ge OP san (cosh?) vtanh (7v) dv. 
This is formula (3.37). Moreover, Exercise 3.2.22 shows that the fundamen- 
tal solution of the heat equation or density for the non-Euclidean normal 
distribution is 


č —b? /4t 
G (ei) = -a al roA (3.38) 
Therefore 
G:(z)>0 forallt >Oandze dH. (3.39) 
Moreover, 
G,(z) 36 ast—> 0+, (3.40) 


where 6 = the Dirac delta distribution on H; i.e., f xô = f. To see Eq. (3.40) note 
that by the Plancherel theorem for H we have 


1 a x 2 
It*G—flB= =f |F*G—/Yo)| vanav) av. 
s=diiv 
veR 
Now 
FG =f G=feee' +f ast0+. 
Thus, 


lf*G -fli 30 ast 0+, 


by the Lebesgue dominated convergence theorem. 
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In fact, G; has the properties of the Gauss kernel, except that it does not appear 
to be possible to find a simpler formula for G, than Eq. (3.37) or Eq. (3.38). For- 
mula (3.38) was obtained by Karpelevich et al. [337], as well as the corresponding 
result for the three-dimensional analogue of H to be discussed in Volume II ( [667], 
where the fundamental solution of the heat equation on hyperbolic 3-space is 
considered). 


3.2.7 The Central Limit Theorem for K-Invariant Random 
Variables on H. Transmission Lines with Random 
Inhomogeneities 


There are many versions of the central limit theorem on symmetric spaces such as 
the upper half-plane. The first papers on the subject were written by mathematicians 
in the U.S.S.R. during the 1950s and 1960s (see Karpelevich et al. [337], Tutubalin 
[688], and Virtser [705]). Mathematicians in the U.S. discussed these matters using 
very general limit theorems on stochastic differential equations in the 1970s (see 
Burridge and Papanicolaou [73], Keller and Papanicolaou [349], and Papanicolaou 
[509]). Mathematicians in France obtained central limit theorems for various sorts 
of Lie groups (see Bougerol [58], Clerc and Roynette [92], and Faraut [173]). 
Limit theorems have also been obtained by Heyer [294] and Triméche [685]. A 
similar discussion of Euclidean rotation-invariant random variables is to be found in 
Kingman [352]. There are many potential applications for such limit theorems; e.g., 
in demography (see Cohen [96, especially p. 290]), learning theory (see LePage 
[412] and the references there), atomic physics (see Hantsch and von Waldenfels 
[248], as well as Hurt and Hermann [310]). Other related papers are Dudley [141, 
142], Furstenberg [188], Furstenberg and Kesten [189], Gangolli [193], Getoor 
[209], and Letac [413]. The last reference gives analogues of classical probability 
problems for various symmetric spaces, including the p-adic symmetric spaces. 

First, let us consider an engineering problem of Gertsenshtein and Vasil’ev 
[208]—a problem which requires the non-Euclidean central limit theorem for its 
solution. We wish to analyze a very long lossless transmission line with random 
inhomogeneities caused perhaps by tiny defects. Such inhomogeneities produce 
reflected waves, the properties of which are described by the reflection coefficient 
which we can view as a random variable Z in H, as we saw at the end of Sect. 3.1. 
The composite of two random inhomogeneities with reflection coefficients Z4 and 
Z in H produces a reflection coefficient 

Z| °Z2 = Mz,Mz,1, where Zj = Mz;i for Mz; E€ SL(2,R), j= 1,2. (3.41) 
This composition is only well defined when the Z; are SO(2)-invariant, which is 
the only case to be considered here. The question then arises as to the distribution 
of Z1 0---oZ,, when it is correctly normalized, as n — œ. Having found the limit 
distribution, one should be able to compute the mean power output, for example. 
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To carry out this project, we need a central limit theorem for H. Luckily we 
already have a candidate density for the non-Euclidean normal distribution, namely 
the fundamental solution of the heat equation on H given by Eqs. (3.37) and (3.38). 
We shall attempt to keep our discussion as close as possible to the discussion of the 
Euclidean central limit theorem (see Theorem 1.2.7 of Sect. 1.2). 

First, we should set down the requisite definitions. A random variable Z in H 
has distribution function 


P(ZEA) = | falz)y? dxdy = f fz du, 


where fz is the density function and 


eS, JAZO Jusi 


We shall consider only SO(2)-invariant random variables on H; this means that 
the density function must satisfy the invariance condition 


fz(kz) = fz(z) forall ze H,kEK. 


The sum or composition of two SO(2)-invariant random variables is defined by 
formula (3.41). 
We shall say that the random variables Z; and Z, are independent if 


P(Z,; € A and Z) € A) = P(Z; € A)P(Z2 E€ A). 


Exercise 3.2.26. Show that if Z; and Z? are SO(2)-invariant independent random 
variables in H with density functions fı and f2, respectively, then the density 
function for Z; o Z is fı * f2, with convolution defined as in Exercise 3.2.19. 


Hint. You can imitate the proof that works in the Euclidean case, since 


PZomeay= | f ADAE) dul) dule) with fy = fz; 


Mz, Mz, icA 


Set w = Mz, Mz, i, and note that 


f znw duw =f filer) f_, Fultz’) duw) duli): 


V 


The characteristic function @z of an SO(2)-invariant random variable Z in H is 
the Helgason or non-Euclidean Fourier transform from Eq. (3.29): 


exp) = | fele) ma dul), s= stip, peR. (3.42) 
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These non-Euclidean characteristic functions possess many (but unfortunately 
not all) of the properties of Euclidean characteristic functions described in 
Theorem 1.2.6 of Sect. 1.2. 


Exercise 3.2.27 (Properties of Characteristic Functions). Which properties of 
Euclidean characteristic functions in Theorem 1.2.6 of Sect.1.2 fail for non- 
Euclidean characteristic functions? 


Hint. The main problem lies with part (2) of Theorem 1.2.6 of Sect. 1.2. 


In the preceding exercise it was noted that @z lacks one important property of 
the Euclidean characteristic function. For @z has domain R or C and not H itself. 
This means, for example, that we cannot say that if a € R, then @,z(p) = @z(ap) for 
p € R*. This property was quite important in our proof of the central limit theorem 
(Theorem 1.2.7 of Sect. 1.2). And the lack of this property seems to be the cause 
of some non-Euclidean trouble. Clerc and Roynette [92] meet this issue head on 
by considering random variables in the domain of the characteristic function rather 
than H. On the other hand, Karpelevich et al. [337] do not mention this problem 
and only sketch the beginnings of a proof of their central limit theorem. We shall 
combine ideas from both these papers. 

The density function for the non-Euclidean normal distribution is defined 
to be G,(z), the fundamental solution of the non-Euclidean heat equation given by 
formulas (3.37) and (3.38). Then if Ne is a normally distributed random variable in 
H, the characteristic function is 


Qn. (p) = exp |- G + ‘| 


In the non-Euclidean case, there are many possible analogues of the mean and 
the variance. We choose the most direct analogues and define for an SO(2)-invariant 
random variable Z in H, the mean mz and the variance or, better perhaps, the 
second moment dz by 


mz = an | fz(e i) rsinhrdr, 
r>0 


(3.43) 
dz=2n Í fz(e™"i) r” sinhrdr. 
r>0 


Before proceeding with our discussion of the non-Euclidean central limit theo- 
rem, we need another asymptotic property of Legendre functions. 


Exercise 3.2.28 (An Asymptotic Relation Between Legendre Functions and 
J-Bessel Functions). 


(a) Show that 


v2 f’ cos(pu) du 
m Jo ./coshr—coshu 


P_1,;,(coshr) = 
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(b) Show that Pins (coshr) ~ Jo(pr), as r > 0. 


The formula in part (b) of the preceding exercise is a special case of a very 
general phenomenon relating the spherical function on G/K to the spherical 
function on the tangent space to G/K at a point. We will consider the analogue 
for SL(n,R) in Volume II [667]. 

The complete asymptotic expansion of P_ 1 ip (Cosh r), as r— 0, can be found in 
Szegé [651] or Fock [181]. This has been generalized to rank 1 symmetric spaces 
by Stanton and Tomas [622]. Clerc and Roynette [92] use a similar result, which is 
analogous to 


A 
PA ipm (cosh 2) ~Jo(pr) asm —> ee. 


In the classical case, the mean and standard deviation of a random variable are 
(essentially) the first and second derivatives of the characteristic function evaluated 
at 0. However, in the non-Euclidean case, this produces different integrals. 


Exercise 3.2.29 (Other Analogues of the Mean and the Variance). 


(a) Show that if Z is an SO(2)-invariant random variable in H, then 


(0) =0. 


This can be viewed as a non-Euclidean analogue of the mean, which differs 
from that in Eq. (3.43). 
(b) Define the dispersion Dz of an SO(2)-invariant random variable in H by 


u? fz(e~"i) sinhr 
pe= 97 (0) _ lem coshr — cosh u 
ọz(0) fz(e™"i)sinhr 


coshr — cosh u 


dudr 


dudr 


Hint. Use Exercise 3.2.28(a). 


Note that Dz can be viewed as a non-Euclidean analogue of the variance, which 
differs from that in Eq. (3.43). 

(c) Show that if Z, and Z are independent SO(2)-invariant random variables in H, 
then 


Dz oz = Dz, + Dz. 


(d) Show that Dy, = 2c, if N, has the non-Euclidean Gaussian or normal distribu- 
tion; i.e., has density given by the heat kernel Ge for H. 


Exercise 3.2.30. Graph the heat kernel G,(e~"i) for various values of c. Com- 
pare with Fig.3.13. Then compute the mean and second moment defined by 
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+ 
0.5 1.0 1.5 2.0 2.5 3.0 


Fig. 3.13 Non-Euclidean heat kernel G,(e~"i) for c = 1/n, n=1,2,...,10 


formula (3.43) for these same values of c. Show that the mean approaches infinity 
as c approaches infinity. This can be derived from the following formula of 
Gertsenshtein and Vasil’ev [208]: 


f G,(e"i)rdr = e”. 
1 


To discuss the non-Euclidean central limit theorem, we suppose that we are given 
a sequence {Zn }n>1 of independent, SO(2)-invariant, random variables in H, each 
having the same density function f. We want to find some way to normalize the 
random variable 


Sn =Z oo Zn (3.44) 


in order to be able to say that the normalized variable, which we shall call S#, 
approaches the random variable with density Ge = the fundamental solution of 
the non-Euclidean heat equation given by formulas (3.37) or (3.38) above. Now 
Karpelevich et al. [337] normalize S,, by noting for A C H, 


P(S, € A) =20 | fs, (ei) sinh rdr. (3.45) 


eic€A,r>0 


Thus we really have densities on Rt, using the non-Euclidean radial variable (r = 
non-Euclidean distance of z € H to i= /—1). We want to normalize S,, by dividing 
the radial coordinate random variable by y/n. Thus the characteristic function of 
the normalized random variable S* is 


pxg(p) = 2x | _ file Tisinhr Py ,,(cosh(r/Vm)) dr, 846) 
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In the classical case, one could easily move the n—'/? over to the p-variable, since 


if X is arandom variable in R, 
Oxival) = |, x) explipx/-Vn)dx = øx(p/ vn). 


But the relation between P_| /9, jp/.yq(coshr) and P_1/24ip(cosh(r//n)) appears to 
be rather bizarre at first sight. 
However, we can use Exercise 3.2.28(b) to rectify this situation and find that 


2g ok rp . 
~ (27 e'i) sinhr Jọ | —= )dr as n — o, 
Ps (p) ( esa ) 0 (=) ) 


And we have the power series for the J-Bessel function: 


10) = 3 Ge G3) 


k>0 


This implies that 


2 n 
Ps (P) ~ (2x fle"ijsinhrdr- Zp | fled)? snhr dr) 


r>0 4n r>0 
apy” 

red eee a e Plt asn o, 
4n 


as in formula (3.43). 

In order to complete the discussion of the central limit theorem for rotation- 
invariant random variables on H, one must imitate the argument given in the proof 
of the Euclidean central limit theorem (Theorem 1.2.7 of Sect. 1.2). We have proved 
that 


lim @s;(p) = exp(—dp* /4). (8.47) 
Note that the density function for the random variable S* is 


ft (kei) = Jal f *---* f)(e7V8) sinh(ry/n) / sinh, (3.48) 


where k € SO(2) and r > 0. Next let œ € CZ (H). And set 


1 
do(p)= ga? tanh (ap) dp. 
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By formula (3.47), the Plancherel theorem (Theorem 3.2.3), and the Lebesgue 
dominated convergence theorem, we have 


n-eco 


lim in (2) du = lim f Uf (3+) Ha G +ip) do(p) 


= | exp(—dp?/4)He (5 +i) do(p) (3.49) 


=h f Gada du. 


We can approximate the indicator function of a measurable set in H by œ in C? (H) 
to complete the proof of the following theorem. 


Theorem 3.2.4 (A Non-Euclidean Central Limit Theorem for Rotation- 
Invariant Random Variables). Suppose that {Zn}n>1 is a sequence of 
independent, SO(2)-invariant random variables in H, each having the same density 
function f(z). Let Sn = Zi © - -o Zn be normalized as in formulas (3.46) and (3.48) 
above. Suppose that f# is the density function for the normalized random variable 
SË. Then for measurable sets A C H we have 


fdu weth f Gadu as n > œ. 


Here G, denotes the fundamental solution of the non-Euclidean heat equation or 
density for the normal distribution given by formulas (3.37) and (3.38). And d is 
defined by formula (3.43). 


Exercise 3.2.31. 


(a) Prove that |P,(coshr)| < 1, if —1 < Re s <0. 

(b) Use (a) to show that if @z is the characteristic function of an SO(2)-invariant 
random variable Z with density function f(z), then |@z(p)| < 1 forall p € R. 

(c) Justify formula (3.49). 


Finally, let us return to the discussion of the lossless transmission line with 
random inhomogeneities. Exercise 3.2.30 showed that the mean distance from i to a 
normally distributed random variable with density G,(z) increases exponentially as 
c approaches infinity. And we can conclude from parts (c) and (d) of Exercise 3.2.29 
that c approaches infinity as the length of the transmission line increases. Recalling 
the discussion in Sect. 3.1 which showed that the transmitted power decreases as the 
distance of the reflection coefficient from the origin increases (measuring distance 
using the non-Euclidean metric), we conclude that a long transmission line reflects 
almost all of the incoming power. More precise calculations might allow an engineer 
to do something about this (see Feller [177, Vol. II, pp. 208—209]). 

There are higher rank versions of the central limit theorem. These will be 
discussed in Volume II [667]. The reader should compare the discussion of the 
central limit theorem for P3 in Vol. II [667, pp. 109-110] with the preceding 
discussion. See also Richards [541]. 
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3.2.8 Some Remarks on the Uses in Theoretical Physics 
of Harmonic Analysis on Groups Such as SL(2,R) 


We shall keep our remarks very brief. Detailed discussions of the representations 
of SL(2,R) can be found in Lang [389], Sugiura [648], and Vilenkin [704]. 
Applications of representations of noncompact groups like SL(2,R) to physics 
are considered, for example, in Barut and Ragzka [28], Mackey [442-444], and 
Wybourne [746]. 

Physicists are interested in representations of groups such as SO(3,1) and 
SO(4,2) which leave invariant various PDEs describing, for example, electromag- 
netic waves or elementary particles. The representations of groups such as SO(p,q) 
have seen much attention. These are the real n = p +q by n matrices which leave 
invariant the quadratic form A +e +x, E i Te ioe x2, We saw in Exercise 3.1.13 
of Sect.3.1 that SL(2,R) and SO(2,1) are closely related. In fact, they have the 
same Lie algebra. See Borel and Wallach [55], Knapp [356], or Vogan [707] for 
more information on representations of Lie groups such as SL(n,R) or GL(n, R) or 
SO(p,q): 

The Fourier transform of a function on G = SL(2,R) involves various series of 
representations of G such as the principal continuous series and the discrete series 
(see Michelle Vergne [697] or Lang [389]). And Fourier analysis on SL(2,R) 
involves a mixture of Fourier series and integrals. The inversion formula goes back 
to Bargmann [25] and Harish-Chandra [252]. We shall not state it here. Instead, let 
us just say a few words about the physical relevance of such expansions. A good 
introduction to this subject is Chap. 18 of Wybourne [746], where it is shown that 
many physics problems lead to the Lie algebra of SL(2,R). In particular, one can 
find the Lie algebra of SL(2,R) or equivalently, SU (1,1), in the ODE: 


F'O) +40) f(y) =0,  q(y) = ay? +by’ +c. 


In fact, this sort of argument allows one to being SO(2, 1) into the discussion of the 
hydrogen atom. Then one can bring in SO(4) and SO(4,2) = SU (2,2)/ {1I}. In 
this way more accurate information on the spectrum of hydrogen has been obtained. 

Clearly, solutions of non-Euclidean wave equations and their properties such as 
the truth or falsity of the Huygens’ principle are of interest in general relativity (see 
Friedlander [186] and Helgason [276]). 


3.3 Fundamental Domains for Discrete Subgroups 
T of G = SL(2,R) 


A two-dimensional smooth orientable, but not compact space of constant negative curvature 
with the topology of a torus [or a sphere] is investigated. It contains an open end; i.e., an 
exceptional point at infinite distance, through which a particle or wave can enter or leave, 
as in the exponential horn of certain antennas or loud-speakers. 


—From Gutzwiller [239, p. 341]. See Exercise 3.3.14 for more details. 
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—1/2< Rez < 1/2, 
D=<zinH lz] 1, 
and |z| = 1 implies Re s > 0 


-1 —1/2 0 1/2 1 


Fig. 3.14 A non-Euclidean triangle D through the points p = e?%/3, p +1, ico, which is a 
fundamental domain for H mod SL(2, Z). The domain D is shaded. Arrows show boundary identi- 
fications by the fractional linear transformations from S and T which generate SL(2,Z)/ {+1} 


3.3.1 Introduction 


We saw in Sect. 3.2 that it was quite useful to study R/Z, which we think of as the 
circle or the interval [0,1] with 0 and 1 identified. And in Sect. 3.4, we saw that it is 
valuable to study crystals M(3,R)/T, where M(3,R) is the Euclidean group and F 
is a discrete subgroup of M(3,R). Similarly, there are higher-dimensional crystals 
M(n,R)/T, with T any discrete subgroup of the n-dimensional Euclidean motion 
group M(n,R), which is a semidirect product of O(n) and the translation group 
R”. Bieberbach proved in 1910 that M(n,R) has only a finite number of different 
subgroups with compact fundamental domains. This answered part of the 18th of 
Hilbert’s famous problems (see Milnor [468, pp. 491-497]). 

In this section, we shall consider discrete subgroups T of SL(2,R). It was 
proved by Poincaré that, unlike R”, the upper half-plane has an infinite number 
of essentially different kinds of compact fundamental domains I’\H. Thus Hilbert 
called these domains Poincaré polygons. We shall be most interested, however, 
in the noncompact fundamental domain for the modular group T = SL(2,Z) of 
2 x 2 integer matrices of determinant 1. A fundamental domain for this group is 
pictured in Fig. 3.14. The main goal of this chapter is to study harmonic analysis on 
SL(2,Z)\H (see Sect. 3.7). In Volume II [667] we aim to discuss the analogue for 
SL(n,Z) and other more complicated discrete groups. 

General references for this section are Apostol [12], Ford [183], Gunning [236], 
Hecke [258,259], Svetlana Katok [342], Klein and Fricke [355], Knopp [357], 
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Fig. 3.15 The Coxeter illustration that inspired Escher. (From Coxeter [114, p. 285]. Reprinted 
by permission of John Wiley) 


Koblitz [359], Lang [391], Lehner [410], Maass [437], Rankin [532], Schoeneberg 
[562], Serre [576], Shimura [589], and Siegel [596]. 

Just as one can tile the Euclidean plane with L-translates of fundamental 
domains R? /L for two-dimensional lattices L, one can tile H with T-translates of 
a fundamental domain I'\H. This produces what is called a non-Euclidean tiling or 
tessellation of H and an interesting picture, sometimes referred to as non-Euclidean 
wallpaper. In 1958, the artist M. C. Escher saw such a picture in a book of H. S. M. 
Coxeter (Fig. 3.15) and was inspired to create his various circle limit pictures (see 
Ernst [166, p. 108]). In Escher’s circle limit pictures, the upper half-plane is replaced 
by the unit disc, using the Cayley transform, from formula (3.10) of Sect.3.1. 
Figure 3.16 is the tessellation of H for I = SL(2,Z). It was drawn by computer 
(as directed by Mark Eggert). In Volume II [667], we will find some beautiful 
three-dimensional analogues of this tessellation, also drawn by computer with Mr. 
Eggert’s direction. You can find other two-dimensional tessellations in Ford [183, 
pp. 305-309], Hurwitz and Courant [312, pp. 430-445], Klein and Fricke [355], 
Lehner [410, pp. 11-17], and Nehari [492, pp. 308-316]. See also the websites of 
David Joyce 


http://alephO.clarku.edu/~djoyce/poincare/poincare.html, 
Helena Verrill 


https://www.math.|su.edu/~verrill/fundomain/, 
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and Gerard Westendorp 
http://westy3 1.home.xs4all.nl/Geometry/Geometry.html. 


Why study harmonic analysis on T\H ~IT\G/K, G = SL(2,R), K = SO(2), 
and the analogue for other Lie groups? One justification comes from the following 
dialogue which appeared in 1953 in Scientific American (see Le Corbeiller [403]): 


“In the last 60 years, however, a new revolution has taken place, and everywhere we look 
we find that what seems to be continuous is really composed of atoms. . ..” 


“But are not modern mathematicians interested in such things?” asked Empeiros [a 
physicist]. 


“They are,” I answered, “but they give them other names. They call them Number Theory 
and the Theory of Discontinuous Groups. Actually they have found much more than we 
can use as yet in physics, but we have in crystals illustrations of some of their simpler 
theorems. . ..” 


Functions on I\H (or equivalently, on the fundamental domain with boundaries 
identified) are called automorphic functions. Surprisingly, every analytic func- 
tion f(z) can be considered as an automorphic function. For f lives on a Riemann 
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surface § which has a (simply connected) universal covering surface Ŝ. The Riemann 
mapping theorem identifies Ô with either the Riemann sphere, the complex plane or 
H. The group of isometries of S contains a subgroup I isomorphic to the group of 
covering transformations of S over S. Thus f is automorphic with respect to I. For 
the details of this argument which we first saw in a course by Joseph Lehner, see 
Siegel [596, Vol. I, Chap. 2], or Hurwitz and Courant [312, pp. 453-550]. 

Historically, the study of automorphic functions arose with the attempt to solve 
various problems arising in applied mathematics, as well as number theory. For 
example, the problem of heat diffusion leads to theta functions (see Exercise 1.3.9 
of Sect. 1.3). The search for a proof of the prime number theorem, giving the 
asymptotic behavior of the number of primes less than or equal to x as x > œ, 
leads to the study of Riemann’s zeta function. The latter function is connected with 
the theta function by Mellin transform (see Exercise 1.4.7 of Sect. 1.4). The simplest 
theta function is 


O(z) = J exp(min?z) forz€ H. (3.50) 
neZ 
Theta is not quite an automorphic function. It is what is known as an automorphic 
form, since theta has the following transformation properties: 


0(z+2) = 0(z), 
@(—1/z) = (z/i)/7 (2). 


The last equation results from Poisson summation (see Exercise 1.3.8 of Sect. 1.3 
and Exercise 1.4.6 of Sect. 1.4). The two maps z > z + 2 and z+» —1/z generate 
a discrete subgroup of SL(2,R)/{+J} called the theta group To. Also, 0(2z) is a 
modular form of weight 5 for the congruence group I'9(4) (see Pfetzer [513] and 
Serre and Stark [581]). The congruence subgroup T(N) of SL(2,Z) is defined by 


non- {(2) 


Many problems of mathematics and physics lead to elliptic integrals, for 
example the lemniscate integral: 


w= f(z) = fa- dz. 


The Dirichlet problem for various domains in the plane can often by solved via 
such integrals, thanks to the Schwarz—Christoffel transformation (see Carrier et al. 
[76, Chap. 4]). It turns out that the inverse function z = g(w) is doubly periodic (see 
Ahlfors [3, p. 232]). The theory of such functions leads one quickly to automorphic 
functions and forms. In fact, one is often able to compute elliptic integrals via theta 
functions (see Sect. 3.4). Other references for these things are Lehner [410], Nehari 
[492, pp. 280-296, 308-316], and Siegel [596, Vols. I, II]. 


a,b,c,d in Z, ad— bc = 1, c= o(moan)}. 
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Fig. 3.16 Tessellation of H for SL(2,Z). (Computer drawing by the UCSD VAX and Mark Eggert) 
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The J-function is another example of an automorphic or modular function that 
can be viewed as giving a certain conformal mapping. It can also be used to prove 
Picard’s theorem that an entire function takes on every finite value with one 
possible exception (see Sect. 3.4). The harder Picard theorem says that a function 
holomorphic in a punctured disc with an essential singularity at the center has the 
property that there is at most one finite number a such that the function takes on 
the value a only finitely many times. There is also a way of dealing with the harder 
result via automorphic forms (see Lehner [411]). 

There are many applications of automorphic forms to algebra and number theory. 
The computation of a fundamental domain for SL(2,Z)\H yields an algorithm 
for the computation of class numbers of imaginary quadratic fields (see 
Exercise 3.3.5, which follows). This is just the classical correspondence between 
ideal classes of imaginary quadratic fields and reduced integral positive binary 
quadratic forms (see Borevitch and Shafarevitch [56, pp. 149 ff], or Davenport 
[120, p. 195 ff]). The same fundamental domain can also be considered a funda- 
mental domain for the reduction theory of positive definite quadratic forms of 
determinant 1 by Exercise 3.1.9 of Sect. 3.1 and Siegel [597, pp. 68-74]. 

Hermite, Kronecker, and Brioschi used automorphic functions to solve the 
general algebraic equation of degree 5 (see Lehner [410, p. 10]). Automorphic 
functions are also necessary for explicit versions of class field theory—the study 
of normal extensions of algebraic number fields with abelian Galois group. The J 
function is useful here, since it has the property that J(a) is an algebraic integer 
when a lies in the upper half-plane and an imaginary quadratic field. More details 
on this subject can be found in Borel and Chowla [53]. Hilbert’s problem 12 asks 
for functions whose special values generate abelian extensions of number fields, 
just as values of the exponential function generate abelian extensions of the field 
Q of rational numbers. The answer for imaginary quadratic base fields requires 
automorphic functions and was found in the late 1800s and early 1900s by Weber, 
Kronecker, Fueter, and others. Much of the work on the subject is due to G. Shimura 
(see [589]). See also Stark [626, 627]. 

Recently automorphic forms have appeared in the proof of Fermat’s Last 
Theorem by Wiles and Taylor. See Mozzochi [480] for a popular account of the 
proof. We will say more about this later as well as many more number theoretic 
applications of automorphic forms. 

The study of the representations of SL(2, R ) also leads to classical automorphic 
forms in the case of discrete series representations (see Gelfand et al. [203, pp. 43- 
48]). So far the only functions on H that we have mentioned in this introduction 
are complex analytic or meromorphic functions of z in H (see Sect. 3.4). However, 
there are other interesting automorphic functions. Their study is more recent. 
They correspond to the continuous series representations of SL(2,IR) and they 
are eigenfunctions of the non-Euclidean Laplacian invariant under SL(2,Z). Such 
functions were first studied systematically in 1949 by Maass [439]. We will consider 
these Maass waveforms in Sect.3.5. They are essential for harmonic analysis on 
T\Æ to be discussed in Sect. 3.7. 
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Automorphic forms and harmonic analysis on fundamental domains of discrete 
subgroups of Lie groups have appeared in many papers in physics journals (see 
Bogomolny [46], Bolte and Steiner [47], Gutzwiller [239-241] and Hurt [308,309]). 
For example, Monastyrsky and Perelomov [474] consider automorphic forms on 
general symmetric spaces and give several references to applications, one of which 
concerns the problem of boson pair creation in alternating external fields. These 
applications are basically a reflection of the usefulness of group representations in 
physics as we mentioned at the end of Sect. 3.2, plus the fact that automorphic forms 
are involved in the description of various series of representations of the Lie groups 
like SL(2,IR). The terminology used in the quantum mechanics work is that the 
automorphic forms appear in systems of coherent states. 


3.3.2 Fundamental Domain for SL(2,Z) 


After this, perhaps overlong, introduction to our subject, let us get down to the study 
of T\H, for discrete subgroups T of SL(2,R). For concreteness, we shall mostly 
restrict our study to the modular group I = SL(2,Z). A fundamental domain D for 
T is a nice subset D of H which behaves like the quotient space '\H (at least up 
to boundary identifications). Thus for every z in H, there is a yin T with yz in D. 
Moreover, if z and w lie in the interior of D and z = yw, then y is either the identity 
matrix J or —I. We shall say that z and yz are equivalent under T if y € T. As in 
Sect. 3.1, the action of y € T on z in H is by fractional linear transformation. The 
example to keep in mind is the interval [0, 1) as a fundamental domain for R/Z. 


Exercise 3.3.1 (A Fundamental Domain for SL(2, Z)). Show that a fundamental 
domain for SL(2,Z) can be taken as the region pictured in Fig. 3.14; that is, 


1 1 
p={zeH | -7 <Rez< 5, |z| > 1, andif |z| = 1, then Rez>0}. 


Hints. 


(1) The Highest Point Method. Suppose z € H. You must find y in T = SL(2,Z) 
so that yz € D. Clearly you can translate z to make Re (yz) € (-3, +4]. Next 
note that Im(yz) = y|cz+d|7?, if yz = (az+ b) /(cz+ d), with a,b,c,d € Z and 
ad — bc = 1. Now you can choose y so that |cz + d| is minimal; i.e., Im(yz) is 
maximal, so that yz is a “highest point” equivalent to z. Why? To prove that |z| > 
1, use the existence of Sz = — 1 /z to obtain a contradiction to the maximality of 
Im(yz) otherwise. 

Next you must prove that z, yz € D for y € T, implies that y = +/. This can 
be done by brute force. Suppose yz = (az + b)/(cz + d). Then 


lez+ dl? = c?@ + cd(z+2) +d? > (|e|— ld}? + led]. 


Therefore c 4 0 implies that |cz + d| > 1. It follows that Im z = Im yz if c £0, 
since you can also consider y~!. Then (|c| — |d|)? + |cd| = 1 and only a few 
possible matrices y need to be considered: S, STS, ST~'S, with S, T as in 
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Fig. 3.14. It is easy to see what these matrices do to the fundamental domain 
(see Fig. 3.17). The case c = 0 is even easier. 

(2) The Generators and Relations Method of Poincaré. If you compute the 
tessellation of H by SL(2,Z) as in Fig. 3.17, you may notice, as Poincaré did, 
that the fundamental domain gives generators and defining relations of the 
projective linear group PSL(2,Z) = SL(2,Z)/{+/,—J}. The generators are 
the maps identifying the sides of the fundamental region. To have pairs of sides 
you need four sides, and not three. So cut the bottom circle in half at the point 
i= \/—I. Then the maps identifying the sides of D in pairs are Tz = z+ 1 and 
Sz = —1/z. Here we write T and S to denote the fractional linear transformations 
corresponding to the matrices T and S. This notation is easily confused with 
complex conjugate and I will usually try to avoid it. 

The defining relations for PSL(2,Z) are obtained by picture. To do this, 
follow the mappings that circle the two vertices p = (—1 + /—3)/2 and 
i= /—1. Since these mappings must lead to the identity when composed, you 
find that (ST)? = T when you follow the copies of D around the vertex p and 


you find that F =I when you follow the copies of D around the point i. 


Further Notes on the Preceding Exercise. Poincaré showed that one can start 
with a geodesic polygon P with certain properties and obtain a discrete group I 
for which the polygon is a fundamental domain. The sides of P must be geodesics 
arranged in pairs equivalent under certain fractional linear maps and equal in non- 
Euclidean length. The sums of angles at equivalent vertices in the fundamental 
domain which are fixed by order k elliptic fractional linear transformations must 
be 27/k. See the end of this section for the definition of “elliptic.” Thus, in the 
case of the modular group, the sum of the angles of the vertices of the fundamental 
domain D at p and p + 1 is 27/3 and 3 is just the order of the fractional linear map 
fixing p. And the sum of the angles at i is just 7 = 27/2, and 2 is again the order of 
the fractional linear map fixing i. In order to complete the preceding exercise using 
Poincaré’s ideas you must show that the generators and relations theorem actually 
gives an algorithm for finding fundamental domains. 

Stark [628] has shown how to generalize these considerations to higher- 
dimensional cases in order to obtain analogues of Poincaré’s results and to compute 
the fundamental domains. Other references for Poincaré’s theory are Beardon [31], 
Svetlana Katok [342], Lehner [410], and Poincaré [517]. 


Exercise 3.3.2. 


(a) Use the Euclidean algorithm to show that PSL(2,Z) = SL(2,Z)/{+1,—J} is 
generated by Sz = — 1 /z and Tz =z + 1. 


ROONG 


Hint. Note that 
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I = STSTST 


STSTS =T 


Fig. 3.17 Poincare’s method of generators and relations illustrated for SL(2,Z)/{+J,-I}. Sz = 

1/z, Tz=z+ 1; p = (—14+ V—3)/2; i = V—T. Generators of SL(2,Z)/{+1,—I} are S and T. 
Defining relations are $? = and (ST)? = I. Here we write T and S to denote the fractional linear 
transformations corresponding to the matrices T and S. The bars here do not refer to complex 
conjugate 


You can choose t € Z to make 0 < a! < c by the Euclidean algorithm. Then you 
can use S to flip a and c. Finally, you can obtain c = 0 in a finite number of steps, 
since a and c are relatively prime. 


(b) Show that the algorithm in (a) also gives the continued fraction expansion of 
a/c (see Hardy and Wright [251]). 


Exercise 3.3.3. Which of Figs. 3.18, 3.19, or 3.20 are tessellations of the unit disc 
coming from the modular group after mapping H to the unit disc under the Cayley 
transform z +> i(z — i)/(z + i)? Why? In Figs. 3.18 and 3.19, a shaded and a light 
region together make up a fundamental domain. 


Exercise 3.3.4. The fundamental domain for SL(2,Z) in Fig.3.14 is a non- 
Euclidean triangle with angles 2/3 at p = (—1+./—3)/2 and p + 1, and angle 0 
at co, Show that the non-Euclidean area of the fundamental domain for SL(2, Z) is 
1/3. Does this agree with Exercise 3.1.7 of Sect. 3.1? 


The fundamental domain for SL(2,Z) was used by Gauss in his research on 
quadratic forms. Instead of quadratic forms, one can consider ideal classes in 
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Fig. 3.18 Tessellation of the unit disc. (From Klein and Fricke [355]. Reprinted by permission of 
Teubner) 


Fig. 3.19 Another tessellation of the unit disc. (From Klein and Fricke [355]. Reprinted by 
permission of Teubner) 
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Fig. 3.20 Yet another 

tessellation of the unit disc. 
(Drawn by the USCD VAX 
computer and Mark Eggert) 


imaginary quadratic number fields as we do in the next subsection. We will consider 
Minkowski’s construction of a fundamental domain in SP, for GL(n,Z) in Volume 
II [667] as well as the analogue for more general discrete groups. 


3.3.3 Computation of Class Numbers of Imaginary Quadratic 
Fields 


Suppose that K = Q(vd ), d < 0, with d = the discriminant of K. There is a one-to- 
one correspondence between ideal classes C € Ig (as defined in Sect. 1.4) and points 
z in the fundamental domain D for SL(2, Z) in Fig. 3.14. For we can choose an ideal 
X in C of the form 
A=ZOLZz, z=(—b+vd)/2a, 
with 
d=b’—4ac, a>0, gcd(a,b,c)=1. 

Here a,b,c are in Z. Thus for z = (—b + Vd) /2a to be in the fundamental domain, 


we need —a < b < a and 1 < |z|? = c/a. It follows from the definition of the 
fundamental domain that the class number of K is 


h =# (abo) EZ? 


gcd(a,b,c)=1,a>0, -a<b<a,c>a, 
c>aifb<0, d=b*—4ac 
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Table 3.2 Class numbers 


A à i Class Class 
of imaginary quadratic fields Discriminant number Discriminant number 
— 1,000, 003 105 —1,000,052 306 
—1,000,007 630 —1,000,055 828 
—1,000,011 368 —1,000,056 364 
—1,000,015 430 —1,000,059 240 
—1,000,019 342 —1,000,063 394 
—1,000,020 320 —1,000,067 318 
—1,000,023 706 —1,000,068 372 
—1,000,024 274 —1,000,072 264 
—1,000,027 168 —1,000,079 974 
—1,000,031 928 —1,000, 083 184 
—1,000,036 192 —1,000,084 300 
—1,000,039 877 —1,000,087 366 
—1,000,040 688 —1,000,088 372 
— 1,000, 043 192 — 1,000,091 342 
—1,000,047 508 —1,000,095 720 
—1,000,051 276 — 1,000,099 187 


Note that you need only count the pairs (a,b), since they determine c. Also, 
the inequalities force a < \/|d|/3. Thus the set of pairs to be considered is at 
most |d| in number. It is an easy matter to program a computer to find the class 
number of an imaginary quadratic field K with d near —10°. Table 3.2 gives 
some class numbers in this range. Shanks [586] used Gaussian composition of 
the quadratic forms ax? + 2bxy + cy” (which corresponds to multiplication of ideal 
classes) to speed the algorithm and found, for example, that if the discriminant is 
—4,722, 366,483, 281,962,074, 113, then hg = 50,866,650, 112. See also Cohen 
[95] and Stein [637]. 


Exercise 3.3.5. Make a table of class numbers of imaginary quadratic fields similar 
to Table 3.2. I wrote the program to do the table on a programmable calculator in 
the early 1970s. How times have changed. 


3.3.4 Dirichlet or Poincaré Polygon or Normal Fundamental 
Region 


There is another construction of the fundamental region. It could be called the 
perpendicular bisector method since the boundaries are formed by perpendicular 
bisectors of geodesics joining a point w and yw, y € T, where w is assumed not to 
be fixed by any element of I. The resulting fundamental domain is often called the 
Dirichlet (or Poincaré) polygon or the normal fundamental region. The name comes 
because Dirichlet used the method for Euclidean space in 1850. Poincaré then used 
it for hyperbolic spaces. See Beardon [31], Svetlana Katok [342], and Maass [437]. 
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Exercise 3.3.6 (Another Construction of a Fundamental Domain for I\H). 
Suppose that T is a discrete subgroup of SL(2,R) and T is infinite. 


(a) Show that T is countable. 
(b) Show that there is a point w € H which is not fixed by any element of T. 
(c) Let d(z1,z2) be the non-Euclidean distance between z; and z2. Set 


D={zEH | d(z,w) < d(z, yw) forall yeT}, 


where w is a point not fixed by any element of I’. Show that D is a fundamental 
domain for T. 


Hints. 


(a) Any neighborhood of the identity in SL(2,R) contains only finitely many points 
of T. 

(c) Using the invariance of the non-Euclidean distance under elements of I, it is 
easy to show that H is a union of images of D under elements of I’. Secondly, 
one must show that z and yz in D imply that z and yz lie on the boundary of 
D. Now the boundary of D consists of points z such that d(z,w) = d(z,Aw) for 
some A € T, while the inequality holds for other y € T. In fact, if we set Dy = 
{z | d(z,w) <d(z, yw)}, then Dy is a hyperbolic half-plane with a boundary that 
is the non-Euclidean perpendicular bisector of the geodesic through w and yw. 
Since D is the intersection of all the Dy, y €T, it is a convex set in the sense of 
the non-Euclidean metric. If z and yz both lie in D, then d(z,w) = d(z,y~'w) 
and z is a boundary point unless y = +/. For more hints, see Maass [437, 
pp. 12-15], or Siegel [596, Vol. II, pp. 35-38]. It turns out that D may not have 
a finite number of sides. If it does, then it has finite area and conversely (see 
Siegel [596, Vol. II, pp. 39-46]). 


We will call the fundamental region in Exercise 3.3.6 the Dirichlet or Poincaré 
polygon or normal fundamental region. In the special case T = SL(2,Z), the 
normal fundamental region with center iyọ, yo > 1, is just the fundamental domain 
in Fig.3.14. The non-Euclidean perpendicular bisectors of the lines joining iyo 
and +1 + iyo are the vertical lines bounding this fundamental domain. The non- 
Euclidean perpendicular bisector of the geodesic joining iyo and i/yg is the circle of 
center 0 and radius 1, the other boundary of the fundamental domain. We mentioned 
an application of the computation of this fundamental domain in Sect. 3.3.3. There 
is also an application to the theory of Riemann surfaces of genus | or elliptic curves. 
For this fundamental domain is in correspondence with the classes of conformally 
equivalent Riemann surfaces of genus | (see Maass [437, pp. 51ff]). 


Exercise 3.3.7. Construct fundamental domains for some congruence subgroups of 
SL(2,Z). These subgroups are defined by formula (3.51), which follows. 


The website of Helena Verrill has a wonderful fundamental domain drawing 
program 
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Second 
Kind 


First First 
Kind Kind 


Fig. 3.21 Fundamental domains for Fuchsian groups. (Reprinted from Lehner, [410, pg. 22], by 
permission of the American Mathematical Society) 


https://www.math.|su.edu/~verrill/fundomain/magmaFD html. 


3.3.5 Other Discrete Subgroups of SL(2,R) 


There is an overabundance of terminology for groups of fractional linear transfor- 
mations mapping the inside of a fixed circle onto itself. We owe this perhaps to the 
rivalry of Poincaré and Klein (see their letters in Acta Math. 39 (1923) or Rankin 
[533]). Poincaré called a group of fractional linear transformations mapping the 
inside of a fixed circle onto itself a Fuchsian group, and Klein called such a group 
a Hauptkreisgruppe (principal circle group). 

There is a finer distinction, according to whether all points of the principal circle 
are limit points of the group. By a limit point of T, we mean a point of accumulation 
of the set of centers of the isometric circles 


Ay={zeEC | |cz+d|=1}, for y= 6 h) e Tande 40. 


If all points of the principal circle are limit points, the group is called a Fuchsian 
group of the first kind in English and a Grenzkreisgruppe in German. One can 
prove that T is a finitely generated Fuchsian group of the first kind if and only if T is 
a discontinuous subgroup of the automorphism group of the interior of its principal 
circle having a fundamental domain of finite Poincaré area. For our purposes the 
word “discontinuous” is equivalent to discrete. See Maass [437], for the definition 
of discontinuous group. 


Exercise 3.3.8. Show that the modular group is a Fuchsian group of the first kind. 


Poincaré called a subgroup of SL(2,C) Kleinian if it had no limit circle. Klein 
apparently did not like this use of his name. Figure 3.21 shows some possible 
fundamental domains for Fuchsian groups. 
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We have already noted that the sides of our fundamental domain for SL(2, Z) 
are identified by the generators of SL(2, Z). This is also true in the general case of 
a fundamental domain with a finite number of sides (see Beardon [31], Svetlana 
Katok [342], and Maass [437, p. 26]). Poincaré reversed the process which creates 
the fundamental domain from a Fuchsian group and its generators. His theorem 
starts with a convex non-Euclidean polygon and its side-pairing maps and generates 
a Fuchsian group. See Beardon [31, Sect. 9.8]. 

Another important example of a discrete subgroup of SL(2,R) is the principal 
congruence subgroup I (N) of level N defined by 


ab 


=d= : 51 
cda) |4 d = 1(mod N) (3.51) 


a, b, c,d E€ Z, 
b=c=0(mod N) 


T(N) = ( 


A reference on generators of this group for N = prime is Frasch [184]. A subgroup 
T of SL(2,Z) is called a congruence subgroup if and only if I contains T(N) for 
some positive integer N. It was discovered in the 1880s that there are an infinite 
number of examples of noncongruence subgroups (see Maass [437, pp. 76-78]). It 
turns out that SL(n, Z) behaves quite differently for n > 3. For it has been proved 
by Mennicke as well as Bass, Lazard, and Serre that every subgroup of SL(n, Z) of 
finite index is a congruence subgroup, if n > 3 (see Bass [29]). Thus the congruence 
subgroup problem was solved for SL(n, Z). 


Exercise 3.3.9. Show that a fundamental domain for a subgroup T; of T = 
SL(2,Z) can be obtained by taking translates of a fundamental domain for T by 
representatives of the quotient ['/T); i.e., if D is a fundamental domain for T, then 
a fundamental domain for T is 


U D. 


yy el /T 


Exercise 3.3.10. Show that if r; = T(N), then it is a normal subgroup of T = 
SL(2,Z) and the quotient I /T; is isomorphic to SL(2,Z/NZ). The only nontrivial 
part of this problem is the proof that for each solution a,b,c,d € Z of ad—bc=1 
(modN), there are integers a; =a, bı =b, cy =c, dj =d (modN) such that 
aid, — b,c, =; 


It is also possible to show that the index of T(N) in SL(2, Z) is 


[SL(2,Z) : T(N) = M° J [0 - p~’), 
p|N 


where the productis over primes p dividing N. This result is proved in Shimura [589, 
pp. 21-22] as well as in Schoeneberg [562, pp. 74-75]. A fundamental domain for 
T (2) is pictured in Fig. 3.22. Since 


[SL(2,Z) :T(2)] = 6, 
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Fig. 3.22 A fundamental 
domain for '(2) 


Fig. 3.23 Fundamental 
domain for the theta group 


we can take our fundamental domain for I'(2) to be the union of six appropriate 
translates of a fundamental domain for SL(2,Z). More information on fundamental 
domains for I (N) can be found in Maass [437, pp. 71-73] and Schoeneberg [562, 
pp. 83-84]. See also Helena Verrill’s website. 

Recall from the formulas following Eq. (3.50) that the theta group is the group 
generated by z > z+2 and z+» —1/z. A fundamental domain for the theta group 
is pictured in Fig.3.23 (see Schoeneberg [562, pp. 84-86] and Maass [437, 
pp. 74-75]). 


3.3.6 Riemann Surface of the Fundamental Domain 


The final step in the study of fundamental domains for discrete subgroups of 
SL(2,R) is to note that they can be made into Riemann surfaces with the upper 
half-plane as a (usually branched) covering surface. 

First, we need to classify the fractional linear maps on the upper half-plane by 
classifying the corresponding elements of SL(2,R) according to their Jordan form 
type (over C). The Jordan form of such a matrix is one of the following: 
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(oa) œ (G5): 


The corresponding fractional linear map will be one of four types. 


The four types of fractional linear map 


(zz identity 
(2) z> cz, |c| =1, c #1; elliptic 

(3) z> cz, c>0,c 41; hyperbolic 
(4)z>z+a, a#0. parabolic 


In case (1), the element y of SL(2,R) is +7, I = the identity. In case (2), the element 
y is called elliptic. In case (3), the element y is called hyperbolic. In case (4), the 
element y is called parabolic. 


Exercise 3.3.11. Show that if o € SL(2,R), o A +I, 


(a) o is parabolic = |Tr o| = 2. 
(b) o is elliptic = |Tr o| < 2. 
(c) Is hyperbolic = |Tr o| > 2. 


More details on the classification of elements of SL(2,R) and SL(2,C) can be 
found in Schoeneberg [562, pp. 2—4], and Shimura [589, pp. 5-10], for example. 

A point z € H is called elliptic (with respect to T) if there is an elliptic element 
ofT fixing z. A point s of RU {æ} is called a cusp (with respect to T`) if there is a 
parabolic element of T fixing s. These will be the interesting points on the Riemann 
surface of a fundamental domain for I. The elliptic points in the fundamental 
domain for SL(2,Z) given in Fig. 3.14 are the points i= /—I, p = (—1+iV3)/2, 
and p + 1 = —p. The last two points are really identified under the action of 
SL(2,Z). The only cusp of this fundamental domain for SL(2,Z) is the cusp at 
infinity. 

For p in HU {o>}, let S, (the stabilizer of p) denote the group of maps z > yz 
which fix the point p, with y € I. For F = SL(2,Z), Sp = the identity, unless p is 
T— equivalent to ©, i, or p. In these three cases one has the following: S- is infinite 
cyclic generated by z++ z+ 1; S; is cyclic of order 2 generated by z++ —1/z; Sp is 
cyclic of order 3 generated by z> —1/(z+ 1). 


Exercise 3.3.12. Check the preceding statements about S, for T = SL(2,Z) and 
any p € H. 


The cusps and elliptic points often make trouble in calculations. We will 
experience this first-hand in our work on the Selberg trace formula. Thus some 
computations have been done for congruence subgroups T(N) rather than for 
SL(2,Z) itself, since these subgroups do not have elliptic points when N > 1 (see 
Exercise 3.3.13 and Fig. 3.22). 
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One can show that compactness of the fundamental domain I'\H, for I a discrete 
subgroup of SL(2,R) implies that T has no parabolic elements. Examples of such 
groups having arithmetic interest come from quaternion algebras (see the discussion 
at the end of this section as well as Montserrat Alsina and Pilar Bayer [4], Fricke 
and Klein [185, pp. 502-634], Gelfand et al. [203, pp. 116-119], Svetlana Katok 
[342], Magnus [446], or Marie-France Vignéras [699]). 


Exercise 3.3.13. Show that the congruence subgroups T(N) of SL(2,Z), with 
N > 1, have no elliptic elements. 


Hint. See Shimura [589, p. 22]. 


For many purposes one needs the compactification of the fundamental domain, 
thought of as a Riemann surface. For the modular group, we need only add the 
point at infinity. This point is called a cusp. Of course, a Riemann surface needs 
local coordinate patches at every point. The only problem in describing the local 
coordinates for the Riemann surface D* = DU {9}, where D is the fundamental 
domain of Fig. 3.14 for SL(2, Z), occurs at the elliptic points and cusps. At any 
point p € D* such that p is not an elliptic point or cusp, define the local coordinate t, 
by tp = (z— p)/(z—p). When p = i, define the coordinate to be t; = [(z—i)/(z+d]?. 
There is a double-branching here. When p = p = (—1 + iv3)/2 is the other 
elliptic point, define tp = [(z—p)/(z—p)]|*. There is a triple-branching here. 
Then finally when p = ©, define t = exp(2ziz). There is an infinite branching 
here. If you think about the fundamental domain and its identifications and try to 
draw neighborhoods of each point you will believe these “weird” coordinates are 
necessary and perhaps not even “weird.” More details can be found in Maass [437, 
pp. 30-37], Schoeneberg [562, p. 27], and Shimura [589, p. 18]. 

One can now compute the genus of the branched Riemann surface D* formed 
from the fundamental domain in Fig. 3.14 for SL(2,Z). The genus is the number 
of handles when you view the Riemann surface as a sphere with handles. Now the 
fundamental domain is a non-Euclidean triangle. Thus you can use Euler’s formula 
for the genus to see that D* has genus zero. It is also possible to use a formula of 
Hurwitz to compute the genus of fundamental domains of other discrete subgroups 
of SL(2,R) (see Svetlana Katok [342, p. 91], Lehner [410, Chap. 6], Maass [437, 
pp. 30-32], Schoeneberg [562, pp. 93-103], and Shimura [589, pp. 18—23]). 


Exercise 3.3.14 (The Leaky Torus) (from Gutzwiller [239]). Consider the fun- 
damental domain D in Fig. 3.24 with its edges identified by 


1 1 1—1 
a=(; and B= (| a 


(a) Show that the domain D, with sides identified as in Fig. 3.24, is equivalent to 
that given by Fig. 3.26, with boundary edges identified by A and B, plus 
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Fig. 3.24 Fundamental 
domain for the group in 
Exercise 3.3.14—the leaky 
torus. From Gutzwiller [239]. 
Reprinted by permission of 
Elsevier 


(HPO) tae. and C= aa 
0-1 1 0 


Does it bother you that the group I generated by A and B is not the congruence 
group I'(2) although the pictures of the fundamental domains in Figs. 3.22 
and 3.24 look the same? Can you explain how different discrete groups can 
have the same fundamental domain except for the boundary identifications? 

(b) Show that the genus of the domain D in Fig. 3.24 is 1, while that in Fig. 3.22 is 0. 
Thus the fundamental domain for I’ generated by A and B above is topologically 
a torus with one cusp. This is pictured as the lower surface in Fig. 3.25. On the 
other hand, the fundamental domain for I'(2) is topologically a sphere with two 
cusps, which is pictured as the top surface in Fig. 3.25. The Riemann surface for 
the fundamental domain of the modular group is pictured as the center surface 
in Fig. 3.25. 

(c) Does the group I’ generated by A and B above contain some congruence group 
T(N)? This would make T a congruence group of level N. 


Hint. See Schoeneberg [562] for hints on part (c). 


Gutzwiller [239, pp. 345-346], notes: “The leaky torus is topologically different from the 
ordinary torus which physicists like to use in Euclidean space. The four corners of the 
domain D become one point as usual, but this point is now infinitely removed. A path 
which goes around this exceptional point cannot be contracted to zero, because that would 
require moving it over an infinite distance.” 


3.3.7 Triangle Groups and Quaternion Groups 


There are discrete groups acting on H called triangle groups. A Schwarz triangle 
group (p,q,r) is generated by reflections in the sides of a geodesic triangle in H 
with angles m/p, /q, T/r. Such groups make sense in the Euclidean plane as well 
as the sphere. In order that the group be hyperbolic, it is necessary that l + 7 + i be 
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Fig. 3.25 Comparison of the surfaces for the leaky torus on the bottom, the fundamental domain 
of the modular group in the center, and the fundamental domain for I’(2) on the top. The x’s in the 
fundamental domain in the center represent the branch points 


less than 1. The modular group is the orientation-preserving elements of (2,3,29). It 
is a subgroup of index 2 in the triangle group. The nontrivial coset is represented by 
the map x + iy > —x-+ iy; i.e., the reflection in the vertical side. Figure 3.19 shows 
a tessellation of the unit disc for the modular group. The union of any shaded and 
white region gives a fundamental domain for the action of the modular group. Any 
individual triangle is a fundamental domain for the triangle group with (2,3,99). A 
Hecke triangle group is similarly the orientation-preserving subgroup of (2,¢,©), 
with q = 3,4,5,6,.... 
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Fig. 3.26 Another fundamental domain for the leaky torus 


Figure 3.18 shows a tessellation of the unit disc for the triangle group (2,3,7). 
The group (2,3,7) is the Klein quartic. The fundamental domain can be identified 
with the projective plane curve x*y+ y*z+2z°x = 0. See Clark and Voight [91, p. 39], 
who construct subgroups of the triangle groups they call congruence subgroups. 

A Fuchsian group of the first kind is called arithmetic if it is commensurable 
with a quaternion group coming from a quaternion algebra over a totally real 
algebraic number field of finite degree. Two subgroups A and B of a group such 
as SL(2,R) are said to be commensurable when ANB has finite index in both 
A and B. Takeuchi [652] has shown that there exist only finitely many arithmetic 
triangle groups up to SL(2,R)-conjugation. You can find Takeuchi’s list of 85 
arithmetic triangle groups in [653] as well as Bogomolny [46, p. 80]. If the 
numbers p and q are finite and r = ©, the only arithmetic triangle groups have 
(p,q,r) = (2,3,%), (2,4,0¢), and (2,6, °°). 

Surprisingly, one can tell whether a group T is arithmetic by looking at the 
spectrum of the non-Euclidean Laplacian A on I'\H. For example, Hejhal [273] 
found no even Maass cusp forms for nonarithmetic Hecke triangle groups, while the 
arithmetic groups do have even Maass cusp forms. And computations of physicists 
such as C. Schmit, show that the nearest neighbor spacings of eigenvalues of A on 
T\Æ for non-arithmetic groups T is that of random symmetric real matrices (often 
called GOE, for Gaussian Orthogonal Ensemble) while that for arithmetic groups I 
is Poisson (e~*). See Bogolmony [46, p. 80] and our discussion of random matrix 
theory and arithmetic quantum chaos in Volume II [667] (or [670, p. 343)). 

In the rest of this subsection we consider discrete subgroups of SL(2,R) coming 
from quaternion algebras. We are particularly interested in those groups with 
compact fundamental domains. References for this subject are Montserrat Alsina 
and Pilar Bayer [4], Gelfand et al. [203], Svetlana Katok [342], David Kohel and 
Helena Verrill [363], Marie-France Vignéras [702], and Voight [708]. 
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The fundamental domains that we are considering here are also called Shimura 
curves (once they are made into a Riemann surface).? Shimura curves are associated 
with an order in a quaternion algebra A over the rationals Q. A quaternion algebra 
is just a four-dimensional vector space over Q (or whatever field is your favorite) 
with a multiplication which is associative but not commutative. If you take the basis 
of A over Q to be {1,i, j,k}, then the multiplication in A is defined by requiring that 


i? =a, j? =b, k = ij = —ji, plus the usual rules for algebras; e.g., the distributive, 


associative laws. We write A = ($). 


Example 3.3.1. Think of the Hamiltonian quaternions. H = R 6iRG jR @ kR, 


2 2 2 yr š in ‘ _ f(-1-1 
where i4 = j4 =k* = —1, ij =k = — ji. This is the quaternion algebra H = ( ==). 


It can be shown to be a division algebra or skew field. That is, every non-0 
element q = qi + q2it+q3j+qak of H, with q1,q2,93,¢4 € R, has an inverse for 
multiplication given by g/Nrd(q), where the conjugate of q = 7 = q1 — qi — q3 j — 
qak and the reduced norm of q is Nrd(q) = 4:7 = q? +q} +q} +44. It is an Exercise 


to check that q - q' = q' -q and that g/Nrd(q) is the multiplicative inverse of a non- 
zero element q. It follows that the Hamiltonian quaternions have no zero divisors, 
which would be non-zero elements q € H such that there are elements w € V with 
qw = 0. 


If the quaternion algebra A is isomorphic to Q?*?, A is called split. In this case, 


A is not a division algebra since it has 0-divisors like (5 ) ; 


Exercise 3.3.15. Show that the quaternion algebra (4) , with the notation defined 


above, is split. 


For an element q = q1 + q2x + q3y + qaz, with q1,q2,43,q4 E Q and q € ($). 
where z = xy, x? = a and y? = b, define the conjugate of q to be F = q1 — q2xX— q3y — 
q4z, as before and then define the reduced norm Nrd(q) = qq, as in the Example 
of Hamilton quaternions. 


Exercise 3.3.16. Check that q-q! = q' -q and Nrd(q) = q:-G= 4} — qa — qyb+ 
2 
qgab. 


We can define a linear mapping from (+) into 2 x 2 matrices with entries in 


Q (Va) by defining @ on the basis of the algebra and extending by linearity: 


ott) =1, 0) = (4°). 


Ply) = ee oz) = e'a 


3The word “curve” may seem odd. We are supposed to think it is one-dimensional over C. 
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Exercise 3.3.17. (1) Show that, with the preceding definition of the Q-linear map 


@ on the basis of the quaternion algebra ($). if q = qı + q2x + gay + gaz, with 


91,92,93,94 EQ, q E (+), we get 


E at+qaVa qg+qıya 
= (6. eM ae 


(2) Show that ọ defines an algebra isomorphism from (#) onto a Q-subalgebra 


of the algebra of all 2 x 2 matrices with entries in Q (va). This means that you 
need to check @ preserves multiplication by scalars, vector addition and algebra 
multiplication, in particular. 


It is possible to do number theory in quaternion algebras A over Q (or some 
other number field) in a similar way to that in number fields except that the non- 
commutativity of multiplication throws a few monkey wrenches into the works. We 
have seen that there are still norms. One also has an analogue of the ring of integers. 
These are called maximal orders in A. An order 9 in A is a subring of A containing 
1 such that, as a Z-module, © is finitely generated and contains a set of 4 vectors 
that are linearly independent over Q. A maximal order is just an order that is not a 
proper subset of some other order in A. 

Fix a maximal order © in A. A unit u € © in A means that uw! is also in 9. 
Note that we mean that u~! is both a right and a left inverse. The units of reduced 
norm 1= U = {u|u,u~! € O,Nrd(u) = 1} is a subgroup of the unit group that can 
be identified (using @) with a subgroup T of SL(2,R). 


IfA= (4) , then p(A) = Q?*?. So we can take our maximal order 9 in A to be 
Z?**, Why is it maximal? The Shimura curve in the case that A = (4) is just the 
usual fundamental domain D of the modular group thinking of D as a Riemann 
surface with its sides identified, cusps added, and branch point neighborhoods 
understood as described previously. 

When the algebra A is not split, the corresponding T\HĦ is compact. The 
construction depends on the choice of © and the embedding of the units into 2 x 2 
matrices. 


Example 3.3.2. Here we just concentrate on one example from David Kohel and 


Helena Verrill [363, pp. 213-215]. They note that this example is a popular one in 


the literature. Let the nonsplit quaternion algebra be A = E) with generators x,y 


over Q and x? = 2,y? = —3,z = xy = —yx. Take O = Z (*) Z4 ()z p ZZ. 


Then we represent x,y,z by matrices as follows: 


x> ba yr er z> (a ae 
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Fig. 3.27 Fundamental domain for an example of a quaternion group from David Kohel and 
Helena Verrill [363] 


One must check that © is a maximal order in A. Kohel and Verrill find a presentation 
of the group of units of positive norm in © : generators 


7 = (x+2y-—z)/2, y= P 
% = (1+y)/2, n = (1 +3y-—2z)/2 J’ 


relations 


{Yi =%=%=% =U"wN = 1}. 


They obtain the fundamental domain given in Fig. 3.27. The labelled points are 


(-2v2+3) v=3 p (4v2-5) (3+2v23) 
3 l 21 


a= 
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(v2-1) (14+-v=2) 


, b=—-b,c =-F. 


C= 


The points d,d’ and e are defined by d’ = mb, d = mb’, and e = ma, where 
Te = 2y—z, which is an element of the normalizer of T. Then ọ (76) = 


ee. ri 


The Shimura curve corresponding to the preceding example is called xé). It is 
the fundamental domain of Fig. 3.27 with sides identified by the generators of the 
quaternion group in Example 3.3.2 and the neighborhoods of points given as we 
described earlier. One has a formula for the area of the fundamental domain F of a 
discrete group coming from a quaternion algebra of discriminant D given by 


area(F) = £ JJ(p- 1), 


p|D 


where the product is over primes dividing D which is the discriminant of the 
maximal order in the quaternion algebra. This is proved by Marie-France Vignéras 
[702] . For the example of Fig. 3.27, the discriminant is 6*. So we find the area is 
27/3. You can check this numerically. The genus of this Riemann surface is 0. Ihara 
proved that this Shimura curve (i.e., the Riemann surface) can be represented as the 
solutions of the equation X 24 Y? 437? = 0. See Kurihara [380]. 

The preceding results can be extended by replacing Q with any totally real 
algebraic number field. Shimura studied such groups since 1960 and viewed them, 
according to Montserrat Alsina and Pilar Bayer [4, p. xiii], as “moduli spaces 
of principally polarized abelian surfaces with quaternion multiplication and level 
structure.” 

What sort of mathematical software should one use to do computations with 
discrete subgroups of SL(2, R)? Helena Verrill uses Magma to compute fundamental 
domains. See her chapter on subgroups of SL(2, R) in The Magma Handbook [698]. 
Montserrat Alsina and Pilar Bayer use the Poincaré package in Maple to do their 
computations. 


3.3.8 Finite Upper Half-Planes and Their Tessellations 


We saw in Sect. 1.5 that it is useful to replace the real numbers R by a finite field 
F; and the plane R? with the finite plane F2. As in that section, we assume that q 
is odd throughout this discussion. So now we replace the Poincaré upper half-plane 
H with the finite non-Euclidean “upper” half-plane H; C F,(v5) where 6 is a 
nonsquare in the finite field F; (for q odd) and 
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Hy = {z=x+yv6| x,y € Fq, y#0}. 


Here our finite analogue of the complex numbers is F,(v5 )=F q2- We have all the 
usual algebraic rules for computing in the field of complex numbers. For example, 
define, for z = x + yV6, the imaginary part of z to be y = Jm(z). Define the 
conjugate Z = x — yv ô = 24, and the norm Nz = zz. 

Note that H4 is can also be viewed as the union of an upper half-plane and a 
lower half-plane. We do not have a good analogue of positivity in a finite field. So 
we are just thinking that all non-0 elements are positive. You may prefer to think 
positive elements in F, are the squares. 

We define a non-Euclidean “distance” between z,w € H, by 


N(z—w) 


aaa NNO 


(3.52) 


This is a natural finite analogue of the Poincaré metric for the classical hyperbolic 
upper half-plane. It is an exercise to show that d(z,w) is invariant under fractional 
linear transformation 


+b 
Zz ae with ic À € G = GL(2,F;) meaning that ad — bc £0. 
cz+d cd 


Exercise 3.3.18. Let K be the subgroup of G fixing võ. Show that 


K= { e a € c} oF, (V5)*. 


The subgroup K is a finite analogue of the group of real rotation matrices O(2,R) 
in GL(2,R). 


Hint. The mapping ọ giving the isomorphism with the multiplicative group 
F,(V5)* =F,(v5) — {0} is defined by setting 


o(s a =a+bv6, for a,b € F4. 
a 


Exercise 3.3.19. Show that (G,K), with G = GL(2,F,) and K as in the preceding 
exercise, gives a Gelfand pair and thus that H, = G/K is a finite symmetric space. 


In the finite upper half-plane we can think of the set {x+ yV5 | yeF,— of i 
for fixed x in F4, and its images under elements g € GL(2,F,) as finite analogues 
of geodesics. Similarly for fixed y in F}, define {x +yV6 | xE F,} and its images 
under elements g € GL(2,F,) to be finite analogues of horocycles. 
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f 


iS 
b 


Fig. 3.28 Two finite upper half-plane graphs. On the left is X3(—1,1) and on the right is the graph 
X5(2,1) which has edges given by the green lines. The magenta dashed lines on the right are the 
edges of a dodecahedron on whose faces the finite upper half-plane graph gives stars. Of course, 
the large bottom face has a very distorted star 


Fix an element a € F, with a # 0,46. Define the vertices of the finite upper 
half-plane graph X,(6,a) to be the elements of H,. Connect vertex z to vertex w iff 
d(z,w) = a. See Fig. 3.28 for examples of such graphs. 


Exercise 3.3.20. 


(a) Show that the octahedron is the finite upper half-plane graph X3(—1, 1). 

(b) Draw the upper half-plane graph X9(6,1), where ô is a nonsquare in Fo = 
F3 (V-T). 

(c) Show that X,(6,46) is a disconnected graph with edges connecting z and Z, for 
every z € Hy. 


Here we view finite upper half-planes as providing a “toy” symmetric space. 
The associated graphs give examples of Ramanujan graphs. An application to 
cryptography has also been found (see P.D. Tiu and Dorothy Wallace [681]). 

The GL(2,F,)-invariant operators on H, analogous to the non-Euclidean Lapla- 
cian on the Poincaré upper half-plane H are the adjacency operators on the finite 
upper half-plane graphs defined for a € F; by 


Aaf(z)= J, fw). (3.53) 
wEHq 
d(z,w)=a 


These operators generate a commutative algebra of operators. 


Exercise 3.3.21. Define finite upper half-plane graphs over the finite fields of 
characteristic 2. The spectra of these graphs have been considered in Angel [9] 
and Evans [167]. 
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It turns out (see Terras [668]) that simultaneous eigenfunctions for the adjacency 
operators of the finite upper half-plane graphs, for fixed 6 as a varies over F4, 
are spherical functions for the symmetric space G/K, where K is the subgroup 
fixing Vô. 

Let G = GL(2,F,). A spherical function h : G —» C is defined to be a K 
bi-invariant eigenfunction of all the convolution operators by K-bi-invariant func- 
tions; it is normalized to have the value 1 at the identity. Here K-bi-invariant means 
f(kxh) = f(x), for all k,h € K,x € G and convolution means f» h, where 


(fxh)(x)= ¥ fO)h (3.54) 


yeG 


These spherical functions are analogues of Laplace (zonal) spherical harmonics 
(see Chap. 2) as well as the Legendre functions h(ke~"i) = P_ Lait (coshr) of 
Sect. 3.2, Theorem 3.2.2. Equivalently h : H — C is a spherical function means A is 
a K-invariant eigenfunction for the adjacency matrices of all the graphs X,(6,a), as 
a varies over F}. One can show ( [668], p. 347) that any spherical function has the 
form 


Y, x(kx), (3.55) 


keK 


hy (x) 


“i 


where y is the character of an irreducible unitary representation 2 of G = GL(2,F,). 
Here m must appear in the induced representation (denoted p =Ind¢1) of the 
trivial representation of K induced up to G. This means p =Ind§1 comes from the 
linear action of g € G on the space V of functions V = {f : G/K =H, > C} via 
le (8) f] (z2) = f (8z), for z € Hg. In this situation (when G/K is a symmetric space 
or (G,K) a Gelfand pair) eigenvalues = eigenfunctions. See Stanton [621] and 
Terras [668, p. 344]. Thus our eigenvalues are also finite analogues of Legendre 
polynomials. 

When 7 is a principal series representation 2 of G, we can write the spherical 
function hz as a sum over K of the analogues of power functions À (Imz), for A a 
character of the multiplicative group F}. When 7 is a discrete series representation 
of G, the formula for the spherical function hz is more complicated. We need both 
types of spherical functions to obtain an orthogonal basis for the space of all K- 
bi-invariant functions, as well as all the eigenvalues of the adjacency operators 
on the finite upper half-plane graphs. Soto-Andrade [611] managed to rewrite 
the sum (3.55), for the case m a discrete series representation of G, as an explicit 
exponential sum which is easy to compute (see formula (3.56) below). Thus we can 
call the discrete series eigenvalues of the adjacency matrices for the finite upper half- 
plane graphs Soto-Andrade sums. Katz [346] estimated these sums to show that the 
finite upper half-plane graphs are indeed Ramanujan graphs as defined in Sect. 1.5. 
Li [419] gives a different proof. Both proofs require the Riemann hypothesis for 
curves over finite fields, which was proved by Weil. 
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Suppose that @ is a nontrivial multiplicative character of the subgroup U of 
* 
Fy (v5) consisting of elements ¢ such that Nt = t!*4 = 1. Let € be the character 


of the multiplicative group F* that is 1 on squares, —1 on nonsquares, and O on 
0. The Soto—Andrade formula for the spherical function associated to a discrete 


series representation of G if d (z V 5) = a and a £0 or 46, is given by: 


@tDhol)= FX e(2(u-1)+5) a(t), (3.56) 
tuo 


See [668, pp. 378 ff.] for a proof taken from Evans [168]. 

One can also view the eigenvalues of the finite upper half-plane graphs as entries 
in the character table of an association scheme. See Bannai [22]. And this whole 
subject can be reinterpreted in the language of Hecke algebras. See Krieg [373]. We 
will soon be discussing another sort of Hecke operator algebra. 

Fundamental domains for subgroups T of GL(2,F,) have been discussed by 
Shaheen [583] using the perpendicular bisector method. To do this, he needs to 
introduce an ordering of F,. Suppose that g € F, is a generator of the multiplicative 
group F}. Order the elements of F, as follows: 


O<lege pcg concept, (3.57) 


Of course this ordering will not have all the properties of the order in R. And you 
could easily invent other orderings. For example, you can view elements of F „2 as 
vectors (x,y) € F, x Fp. You could use the order in formula (3.57) for elements of 
F, and then the lexicographic order on the 2-vectors from F, x Fp. 

Now Shaheen defines two finite Dirichlet polygons 


Dr(zo) = {we H,| d(zo,w) < d(yzo,w), Yy ET, Y Æ cl, for some c € Fy}, 


and 


Dr(zo) = {w € Hy| d(zo,w) < d(yzo,w), VY ET, Y Æ cl, for some c € F4}. 
Shaheen proves that for any z € H4, there exists y € I such that yz € Dr(zo). He 
also shows that if z,w € Dr (zo) and yz = w for some y ET, it follows that y = cl, 
for some c € F}. This means that Dr(zo) has properties analogous to the Dirichlet 
polygon or normal fundamental region of Exercise 3.3.6. 


Exercise 3.3.22. Do the finite analogue of part (c) of Exercise 3.3.6. 


One can also consider I"—tessellations of the finite upper half-plane H,, where we 
assume q = p”, with r > 1, and I = GL(2,F,,). We used the finite fields package in 
Mathematica to produce Fig. 3.29, which represents a tessellation of a fundamental 
domain for GL(2,F\,) on H121. A fundamental domain for GL(2,F);) can be taken 
to be any set of 11 colored squares. 
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Fig. 3.29 Tessellation for GL(2,F 11) acting on the finite upper half-plane over the field with 11 * 
11 elements. A fundamental domain consists of points with 11 distinct colors. The figure is rotated 


90° and the white line down the middle denotes the analogue of the real axis, which is excluded 


from A 
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Exercise 3.3.23. 


(a) Compute the fundamental domain for GL(2,F5) acting on H25 and draw a 
tessellation of H25 given by this fundamental domain. 

(b) Use the orbit-counting lemma (often called the Burnside lemma) to show 
that GL(2,F,)\H,2 has p elements. This lemma says that if Fix(y) = 


{ z€H,,2 


y= ak; then 


= Lira. 


Ir\F,2 
yer 


3.4 Modular or Automorphic Forms-Classical 


The abundance of topics in this subject is so large that I have investigated only a few 
problems. I think that these facts could be put in order only by using notions from different 
arithmetical theories which are perhaps not sufficiently developed. 


—From Hecke [259]. 


3.4.1 Definitions, Eisenstein Series 


There are many books on classical (by this I mean holomorphic or meromorphic) 
modular or, more generally, automorphic forms. To the list at the beginning of 
Sect. 3.3, you can add Diamond and Shurman [133], Iwaniec [319], Kilford [350], 
Koblitz [359], Ogg [505,506], Sarnak [555], Stein [638], and Zagier [749], for 
example. Thus I do not feel compelled to write a book on this subject and we shall 
only present some of the classical theory in this section (and none of the modern 
adelic theory). Moreover, we will usually consider only the discrete group SL(2, Z). 
For a treatment from the point of view of adelic representation theory, see Bernstein 
and Gelbart [37], Bump [71], Gelbart [200], Goldfeld and Hundley [219], or Jacquet 
and Langlands [323]. 

There are also many websites. One such is http://www.lmfdb.org/ for the 
LMFDB, which stands for the database of L-functions, modular forms, and related 
objects. Another is http://modform.org. William Stein’s website has many useful 
things including books and computations with SAGE. Here we will use Mathemat- 
ica mostly. 


Definition 3.4.1. A function f : H — C is said to be an (entire) modular or 
automorphic form of weight k for the discrete group T = SL(2, Z) if 
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(1) f is entire on H; i.e., f(z) is holomorphic for every z € H; 
b\. 
(2) f(y) = (cz+ d) f(e), for any y= & r) inT; 


(3) f is holomorphic at infinity; i.e., f has the Fourier series expansion 


f(z) = ¥ anexp(2rinz). 


n>0 


Usually the word “modular form” refers to a function with an invariance property 
for SL(2,Z) or one of its subgroups while we call functions with invariance 
properties for more general discrete subgroups of SL(2, R) “automorphic forms.” In 
order to check (2), it suffices to check that f(z+ 1) = f(z) and f(—1/z) = ef (2). 
Property (3) is equivalent to the requirement that f be bounded in the fundamental 
domain for SL(2, Z). 

The vector space of holomorphic modular forms of weight k for SL(2,Z) 
will be denoted M(SL(2,Z),k). In order that this space be nonzero, the weight k 
must be a nonnegative even integer. To see that k must be even, replace y by —y 
in property (2). To see that k must be nonnegative, note that the function y‘/?| f(z)| 
is invariant under SL(2,Z). If k < 0, y*/? and thus y‘/?|f(z)| is bounded on the 
fundamental domain and thus on all of the upper half-plane. Thus | f(z)| < My~*/2 
for some constant M. But then the coefficients in the Fourier series for f satisfy 


1 
anexp(—2rny) = f f(x+iy)exp(—2rinx)dx. 


Thus |a,| < My—*/? exp(2mny). If k is negative, it follows that a, = 0 for all n, since 
you can let y approach zero (from above). 

In order to obtain nonconstant modular forms of weight k = 0, one must relax the 
definition and allow the function f to have poles. Automorphic forms of weight zero 
are called automorphic functions. For groups I with more than one cusp, one must 
consider Fourier expansions at each cusp in part (3) of the definition of automorphic 
form. And in order to call the theta function in formula (3.50) of Sect. 3.3 a modular 
or automorphic form of weight 1/2, one must introduce the concept of multiplier 
system in part (2) of the definition of modular form. A multiplier system v : IT — C, 
corresponding to the weight k, has the properties |v(y)| = 1 for all y € T, and 


vnn) (32+ d3)* = v(N)v(%)(e1 Hz + d1)*(c2z + d2) (3.58) 


for 


“= & a) , i= 1,2,3 and B= Nh. 
Ci di 


If k is an integer, then Eq. (3.58) reduces to v(y y) = vy (y). 
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We say that f : H — C is an automorphic form of weight k and multiplier 
system v for T if f satisfies the conditions (1) and (3) of the preceding defini- 
tion (perhaps modified somewhat as we mentioned above), and condition (2) is 
replaced by 


E) fe) =v(nN(ec+ ats fory= (74) er 


Here, when the weight k ¢ Z, one must fix the branch of (cz + d)* by assuming that 
zÆ = |z|te 8, a <argz<a, forz €C. 


There is an overabundance of terminology with respect to the word “weight.” 
Some replace k by —k (cf. Lehner [410, p. 268]). Some use the word “degree” (cf. 
Knopp [357, p. 12]). See also Serre [576] for slightly divergent notation, as well as a 
nice short survey of the subject of this section—a survey which we follow somewhat 
here. 

As an example of a modular form of weight k = 4,6, ..., consider the Eisenstein 
series G; defined by 


Gi(z)= Š, (mz+n\™*, k=4,6,8,.... (3.59) 


Exercise 3.4.1. 


(a) Set 
Teo = { F r) € s122} 
Oc 
Show that 
d k/2 
Gale) = 24) (2) 
yeTeo\SL(2,Z) K 


where ¢(k) is Riemann’s zeta function. The sum is over a complete set of 
representatives y € SL(2,Z) for the equivalence relation 


nN~%v iff y =yp, forsome yET.. 


(b) Show that G;(z) is a nonzero modular form for SL(2,Z) of weight k, if k = 
4,6,8, 


Hints. For the convergence of Eq. (3.59), compare the series with the Epstein zeta 
function in Exercise 1.4.5 of Sect. 1.4. To see that G; has property (2) of modular 
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forms, use part (a) and the chain rule. To see that G; 40 when k = 4,6,8, ..., show 
that 


lim Gx (iy) = 26(k), k= 4,6,8,.... 
ae 


Exercise 3.4.2. Obtain the complete Fourier expansion of the Eisenstein series Gx, 
k=4,6,8,10,.... 


Answer. 


Gx(z) = 26(K) 


n) exp(27inz), 
LaS 


where the divisor function is 


n)= £ d. 


0<d|n 


Here the sum is over the positive divisors of n. 


Hint. One method is to use the Poisson sum formula (see Maass [437, pp. 209-212] 
for a generalization). Another method starts with the partial fraction expansion of 
Tcot(7z) (see Shimura [589, pp. 32-33]). 


Exercise 3.4.3 (The Discriminant A). Define* the discriminant 
A(z) = (6064)? — 27 (140G5)”. 


Show that A is a modular form of weight 12. Then prove that A(cc) = 0. Feel 
free to use Exercise 3.5.7 from Sect.3.5. Graph a real-valued function related to 
A(z) with Mathematica or your favorite program. Compare with Fig.3.30 where 
a Mathematica density plot of (27)~!*y®|A(z)| is shown, using the relation A = 
(2x)!?1n4, involving the Dedekind eta fio defined below. See formulas (3.68) 
and (3.69) below. A website with many density plots of modular forms is that of 
Frank Farris. See the following page for density plots of various sums over the 
modular group produced by Frank Farris and Jeff Hoffstein: 


http://math.scu.edu/~ffarris/auto/auto.html 
The website 


http://math.scu.edu/~ffarris/EdgeofUniverse.pdf 


*Hopefully it will not be too confusing to use A both as the Laplace operator and as this modular 
form. 
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Fig. 3.30 This density plot of (27)~!?y®|A(z)| was produced with the Mathematica command: 
DensityPlot [Abs [(y%*6) «DedekindEta[x + I*xy]*24], {x, -2, 2}, fy, 
0, 2},ColorFunction -> Hue, PlotPoints -> 500] 


contains a color version of the Farris article [174] which did not appear in color on 
the M.A.A. website for the journal. 


A modular form that vanishes at infinity is called a cusp form (forme parabolique 
in French or Spitzenform in German). We define the space of (holomorphic) cusp 
forms of weight k, S(SL(2, Z),k), to be the modular forms of weight k that vanish 
at infinity. From Exercise 3.4.3, we see that A € S(SL(2,Z), 12). However, it is 
possible that A is identically zero. There are many ways to see that this cannot 
happen. Look at a table of Fourier coefficients of A such as Table 3.3, for example. 
After we have more information on the orders of zeros of modular forms we will 
learn that A does not vanish at any point in H. See the proof of Theorem 3.4.1. This 
means that you can divide by A and that is a useful fact for class field theory (see 
Borel and Chowla [53]). 

The facts about zeros of modular forms can be derived in several ways. First, if 
p is a point in the usual fundamental domain D for SL(2, Z) as in Fig. 3.14, then let 
n = np( f) denote the order of the zero of f at p; that is the integer n such that 
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Fig. 3.31 The contour C 
needed for the proof of 
formula (3.60) 


fA) =0 


Fig. 3.32 The arc A near a 
zero of f(z) 


f(z)(z — p)~" is holomorphic and nonzero at p. If f is in S(SL(2,Z),k), then 
np( f) =Nyp(f) for all yin SL(2, Z). The order of the zero of f at infinity is denoted 
n =Nn.o(f) and is obtained by looking at the Fourier expansion; i.e., 


f(z) = X, arexp(2mirz), an #0. 


rn 


Let e; denote the order of the group of maps z > yz (with y €T) fixing z 4 ©. 
Then e; = 1 when z is not I—equivalent to i or p = e2ni/3. ei = 2 and ep = 3. The 
fundamental formula for the orders of zeros of modular forms of weight k for 
SL(2,Z) is 


np(f) k 
no(f)+ ¥ = = D (3.60) 


peD ©P 


where D is the standard fundamental domain for SL(2, Z) (see Fig. 3.14). 

The easiest proof of this formula comes from the Riemann—Roch theorem (see 
Gunning [236, Chap. II], and Shimura [589]). This method also generalizes to 
higher dimensions (see Yamazaki [747]). However, in order to keep this section 
as elementary as possible, we will present a proof by contour integration. 

The proof will be carried out by integrating dlog f(z) over the contour C in 
Fig. 3.31, taking the contour to enclose all zeros of f in the fundamental domain 
minus the points i, p, p + 1, œ. Note that f has only a finite number of zeros in the 
standard fundamental domain because f is holomorphic at infinity, so zeros cannot 
accumulate there. 
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To do the contour integral over C, first note that if f(z) = (z — p)”g(z) with 
g(p) #0, then, when you integrate dlog f(z) over the arc A in Fig. 3.32 and let 
the radius r go to zero, you get mai. There is cancellation of the curved parts of 
the contour integral of dlog f(z) around the zeros of f on the vertical boundaries. 
And the curved part of the contour integral of dlog f(z) around i contributes —n,7i. 
Finally, the curved part of the contour integral around p contributes —np7i/3, with 
the same contribution for the curved part of the contour integral around p + 1. 

The limit of the part of the contour integral of dlog f(z) along the horizontal 
contour at the top of C, as the horizontal line moves up to infinity, is —27vin.., using 
the expansion of f(z) at infinity. 

The contour integral over the bottom of the contour C is obtained from the fact 
that f is a modular form of weight k, as follows: 


p 
f dlog f(z) = / dlog f(z) —dlogf(—1/z) 
bottom of contour i 


=f HS =x(5 Je 


Now, since the integrals over the vertical parts of the contour C cancel, one obtains 
the following formula for the number n of zeros of f(z) in the fundamental domain 
minus the points i, p, p + 1, œ: 


1 Ni np np k 
= _ l = Tovel . 
” A gfe) =-776 6 "t 


Here the zeros are counted with multiplicity. This completes the proof of for- 
mula (3.60). 
Formula (3.60) leads to the following theorem. 


Theorem 3.4.1 (Formula for the Dimension of the Space of Holomorphic 
Modular Forms for SL(2, Z)). If |x| denotes the floor of x; i.e., the greatest 
integer less than or equal to x, we have the following formula for the dimension of 
the space of modular forms of weight k for SL(2,Z) 


dy = dime M(SL(2,Z),k) 
0, if k<0 ork ¢ 2Z, 
1, if k=0, 

| S| +1, if k=0,4,6,8, 10(mod 12), for k > 0, 


LS], if k=2(mod 12), fork > 0 
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Proof. We have already proved the first line of the dimension formula (see the 
discussion immediately following the definition of M(SL(2,Z),k)). 
Using formula (3.60), we find that d} = 0, since 


is not solvable in non-negative integers n, Noo, Ni, Np. 
It also follows that 


dy < 1+ |k/12]. (3.61) 


For suppose that m > 1 + |k/12] and fi, ..., fn E M(SL(2,Z),k). Then there are 
constants c1, ..., Cm, not all of which are zero, such that 


m 
Y cif” (c) =0= for j=0,1,..., m—2. 
i=l 


Thus f = x” cif; is an element of M(SL(2,Z),k) with a zero of order > m— 1 at 
co, But then the formula for the orders of zeros of elements of M (SL(2, Z), k) forces 
a contradiction. The second line of the dimension formula follows from Eq. (3.61). 

We know from Exercises 3.4.1 and 3.4.2 that the Eisenstein series Gy, k = 
4,6,8,..., are nonzero elements of M(SL(2,Z),k). This implies that dy = 1 for 
k =4,6,8, 10. Then Exercise 3.4.3 implies that dı2 = 2, using formula (3.60) again. 
For we find from formula (3.60) that n(A) = 1 and n = nj = np(A) = 0. Thus A 
does not vanish at any finite point of H. Note here that A is not identically 0, since 
G4(p) = 0 = Go(i). 

To finish the proof of the dimension formula, note that every form in 
M(SL(2,Z),k) is a unique linear combination of forms Gx_12,A", for n > 
0, k— 12n > 4. To prove this, use the fact that there is a constant ag with 
(f — agGx) (cc) = 0. Since A is only zero at œ, you obtain (f — apG,)/A € 
M(SL(2,Z),k — 12). The proof is completed by induction. 


It follows from the preceding argument that every modular form of weight k is 
a unique linear combination of terms of the form G{G? , with nonnegative integers 
a,b such that 4a + 6b = k. The proof is in Serre [576, p. 89]. 

More general results on dimensions of spaces of automorphic forms with 
multiplier systems for SL(2, Z) can be found in Maass [437, p. 144]. Both Eisenstein 
series and Poincaré series (to be considered later in this section) are used to generate 
the spaces of such forms. The same result can also be found in Rankin [532, 
p. 208]. Both of these references show that there are exactly six different multiplier 
systems for SL(2,Z) for each weight k (see Maass [437, p. 132] and Rankin [532, 
p. 206]). Rankin obtains the basis elements of the form G, 12g Ste) 12. 0<s< 
(k—r)/12, with r = 0,4,6,8,10,14 if k > r. Again, it is important that A does not 
vanish at any point (not c) in the upper half-plane. If k < r, then there are no forms 
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but 0. If k Æ r(mod 12), the forms above are all cusp forms. A dimension formula for 
automorphic forms for more general discrete groups is to be found in Shimura [589, 
pp. 46-47]. The computation is a nice application of the Riemann—Roch theorem for 
curves. 

Next we want to consider some examples of modular forms—the discriminant A, 
the modular invariant J, theta functions, etc. 


3.4.2 The Discriminant, the Weierstrass s7-Function, 
and Ramanujan’s Tau Function 


The discriminant A(z) is defined by the formula in Exercise 3.4.3. It will be 
convenient to use the notation 


82 =60G4 and g3 = 140G¢. (3.62) 


The function A is called the discriminant because it is the discriminant of the cubic 
on the right-hand side of the equation 


u? = 4u? — g2U — 83, (3.63) 


satisfied by the Weierstrass elliptic function u = f2(t) (see Ahlfors [3, pp. 264-268] 
or Koblitz [359]). The cubic equation in formula (3.63) is an elliptic curve. We will 
say more about these curves later. 

Define the Weierstrass function f2(t) = ga(t,z) for t € C and z € H by 


1 1 1 
z7) = =+ i 3.64 
plt z) 12 mop ( (t Swe nz) (m ra nz}? ) ( ) 


The function g(t, z) is really associated to the period lattice L = Z ® zZ. Sometimes 
gis written ga(t,z) = ga(t;1,z) = (t; L). The only poles of a(t) are double poles 
at the lattice points. To see that f2(t) is doubly periodic with periods given by the 
points of the lattice, first observe that its derivative gJ (t) is clearly doubly periodic. 
Then argue as in Koblitz [359, pp. 17-18]. 

The differential equation for ¢a(z) shows that it is the inverse function for the 
elliptic integral in Weierstrass normal form, i.e., 


gz) 
z-0= | (4w? — gaw — g3) "dw. 
SA(z0) 


(see Siegel [596, Vol. I] and Lang [390]). The fact that the discriminant does not 
vanish on the upper half-plane shows that the cubic Eq. (3.63) has three distinct 
roots €1, @2, e3 such that 
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Table 3.3 Ramanujan’s tau function 


7A Es Eee 5 


Sa =P TR BLSA = SET 


s9'(z)” = 4(@(z) — e1)((z) — €2)((z) — e3). 


One can show (see Apostol [12, p. 13] that the roots are given by values of gg at the 


half periods: 
a= e2 = (5) e3 = f2 Sa 
I= zj’ 2 = 49 5)? 835 z J’ 


Then one can construct the function À called the modulus by 
A(z) = (e3 — e2)/ (e1 — e2). (3.65) 


It turns out that the function A is an automorphic function for the congruence 
subgroup ['(2) (see Ahlfors [3, pp. 277-279]), and gives a useful conformal 
mapping. We will consider À again later in this section. 

The Ramanujan t-function is defined on the positive integers as follows: 


(27)? A(z) = ¥ t(n)exp(2rinz). (3.66) 


n>1 


It is possible to use the Fourier expansion of the Eisenstein series given in 
Exercise 3.4.2 to see that the Ramanujan numbers t(n) are all integers. Table 3.3 
gives the first 11 of them. 

The Ramanujan conjecture states that 


|t(p)| <2p!'/? forall primes p. (3.67) 


This implies, using the multiplicative properties of the t-function, that 


It(n)| < n!! Poon) where  o(n)= Ù, 1, n=1,2,3,.... 


o<d|n 


The multiplicative properties of the t-function will be discussed in Sect. 3.6, using 
the theory of Hecke operators. Deligne proved the Ramanujan conjecture in the 
1970s as well as its generalization to forms of weight k as a consequence of the 
Weil conjectures in algebraic geometry (see Deligne [126, 127] and the expository 
article of Katz [344, p. 14]). The generalization is called the Ramanujan—Petersson 
conjecture (see Theorem 3.4.4 and the discussion following it). There are also 
analogues for nonholomorphic Maass waveforms which have not been proved yet 
(see the discussion of the tables in Sect. 3.5 as well as that following Theorem 3.5.3 
of Sect. 3.5). 
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Another question concerning T(n) is still open: Can T(n) vanish? The Lehmer 
conjecture says it cannot. In 1985 computer tables existed showing that t(n) 4 0 
for n < 10°. J. Bosman has proved that t(n) 4 0 for n less than a number close 
to 2 x 10!°. See Bosman’s article in Couveignes and Edixhoven [112]. Still no one 
knows how to prove the Lehmer conjecture (see Lehmer [406—408]). 

Write T(p) = 2p!!/ 2 cos Ọp. Inspired by the Sato—Tate conjecture for elliptic 
curves (see formula (3.78) below), Serre conjectured that the angles @, are 
distributed in [0, 2] with respect to the measure 


2 49 d 
— sin 7 
m pap 


Lehmer [409] did a numerical study. Serre [580] and Ogg [507] showed that the 
Sato-Tate conjecture would follow from nonvanishing properties of the analytic 
continuation of L-functions associated to the mth symmetric power representation: 


= 0-0 on i oy 1 ar app = p"! 
=) ae G=p 2 exp (ifp). 
The necessary results about L, (s) were proved for m = 1 by Hecke, for m = 2 by 
Rankin and Shimura, and for m = 3,4 by Jacquet and Shahidi. Additional references 
for this subject are Serre [579], Ram Murty [486, 487], Katz [345, p. 14], and 
Moreno and Shahidi [478]. Note that Lehmer’s problem mentioned in the preceding 
paragraph is the question, Can @, = 2/2? Nagoshi proved in 2006 that the Serre- 
Sato—Tate conjecture holds on average. See Nagoshi [489]. In 2011 the Serre—Sato— 
Tate conjecture was proved by Barnet-Lamb et al. [26]. The original Sato—Tate 
conjecture for elliptic curves has also been proved. See Clozel et al. [93]. 


Exercise 3.4.4. Another way to state the now proved Serre-Sato—Tate conjecture 
for the Ramanujan t-function goes as follows. Let 7 (x) be the number of primes < x, 
dp = 1(p)p 2 and [c,d] C [—2,2]. The conjecture says that the a, are distributed 
according to the semi-circle distribution, meaning that 


lim #{p <x | p prime, ap € -if V1—x? dx. 


x— 20 (x 


Check that the histogram for a large set of a, does resemble this semi-circle 
distribution. 


As noted in Sect. 1.5, Wigner (see [738]) had the idea in the 1950s of making 
a toy model of the Schrödinger operator in quantum mechanics using large real 
symmetric n x n matrices with entries that are independent Gaussian random 
variables. He found that the histogram of the eigenvalues looks like a semi-circle (or, 
more precisely, a semi-ellipse). This has led people in physics to call the distribution 
of the preceding exercise the “Wigner semi-circle distribution”—around the same 
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time that number theorists were calling it the “Sato—Tate distribution.” Wigner’s 
work was the beginning of a subject called “random matrix theory” and we will 
discuss it in Volume II [667]. 

Another useful fact about A(z) is Jacobi’s identity: 


A(z) = (2x) qJ — q") for q = exp(2ziz). (3.68) 


n>1 


Proofs of (3.68) can be found in Maass [437, p. 108], Ogg [505, pp. 1-43-44], 
Rankin [532, pp. 196-197], Serre [576, pp. 95-96], and Weil [727, Chap. 4], for 
example. 

The eta function of Dedekind is defined by 


n(z) =q" ” [[0 - 4"), q = exp(2riz). (3.69) 


n>1 


Like the theta function in formula (3.50), the eta function is a modular form of 
weight 53 but eta is a form for the whole modular group SL(2, Z). In particular, eta 
satisfies 


net 1)=n(z) and (-1/2) =(2/i)'?n(). (3.70) 


Note that, like theta, the eta function is an automorphic form with a multiplier 
system. If you can prove the last transformation formula in Eq. (3.70) for eta, then 
you can prove the product formula for the discriminant Eq. (3.68) since the product 
formulas imply that A = (27)!2774 and there is only one cusp form of weight 12. 
The transformation formula of eta is proved in many of the basic references on 
modular forms as well as in Pilar de la Torre and Goldstein [682] and Goldstein 
[224]. The complete multiplier system for eta involves Dedekind sums. We will 
have more to say about this in Sect. 3.7.7 on modular knots. 

One of the main applications of the eta function to number theory comes from 
the fact that 


qne = & p(n)aq", 


n>0 
where p(n) is the number of partitions of n; i.e., the number of ways of writing 
n=nt+::-+n,, nj EZ, nj >0, j=1,2,...,7 
Rademacher used this fact to obtain an exact formula for p(n) as an infinite series 
with terms involving certain finite trigonometric sums A;(n) and J-Bessel functions. 


The method is a further development of a technique of Hardy and Ramanujan from 
1917, which begins with the Cauchy formula 


1 
p(n) = z fr" Teds, 
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Fig. 3.33 A close up of (27)~!?y° |A(z)| produced in the same way as Fig. 3.30 


with C a closed curve inside the unit circle and containing the origin. Here f(q) = 
q'/*4n(z)~!. The path C is then taken as a circle around 0 and close to the unit 
circle. It is then cut into parts corresponding to neighborhoods of just one root of 
unity. For a discussion of the rather intricate arguments that ensue, see Rademacher 
(525, Chap. 14], or Lehner [410, p. 351]. The method is called the “circle method.” 
For some history of other work on formulas for p(n), see D.H. Lehmer’s review of a 
paper of Whiteman which can be found in the collection of reviews in number theory 
edited by Leveque [414, Vol. IV, pp. 510-511] or by visiting the Math. Reviews 
website. Selberg (age 19 or so) obtained a formula for A(n) as a finite Fourier 
series at around the same time that Rademacher had obtained his result. Lehmer had 
factored A(n) to make computations. 

The function 7(z) also appears in Kronecker’s limit formula which has many 
applications in number theory (see Exercise 3.5.6 of Sect. 3.5). The eta function 
and its generalizations have also been used to obtain class number formulas (see 
Goldstein and Razar [226]). 

Before leaving this section we look at Fig.3.33, which is a close-up of 
(27) ~!?y® |A(z)| produced in the same way as Fig. 3.30. 
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3.4.3 The Modular Invariant 


Our next example of a modular form is the modular invariant J(z) defined by 
J = (60G4)?/A. (3.71) 


This function was first constructed by Dedekind in 1877 and Klein in 1878. Here, 
once again, the definition seems mysterious, but we shall see that J is of fundamental 
importance, as is j = 1728J. The number 1728 gives j integer coefficients in its 
Fourier expansion. On the other hand, J has nice values at the elliptic and parabolic 
points (cusps) in a fundamental domain for SL(2, Z). 


Theorem 3.4.2 (The Main Properties of the Modular Invariant). 


(1) J(z) is a modular function, i.e., an automorphic form of weight 0, which may 
have poles. J is holomorphic in H with a simple pole at ©, J(i) = 1, and J(p) = 
0, where i= /—1 and p = (—1 + V—3)/2. 

(2) J defines a conformal mapping which is one-to-one from D onto C, where 
D is the fundamental domain for SL(2,Z) in Fig. 3.14. Thus J provides an 
identification of H /SL(2,Z) U {ce} with the Riemann sphere CU {co}. 

(3) The following are equivalent for a function f which is meromorphic on H U 
{%}: 

(a) f is a modular function (i.e., invariant under SL(2,Z)). 

(b) f is a quotient of two modular forms of the same weight. 

(c) f is a rational function of J, (i.e., a quotient of polynomials in J) and thus J 
is called the Hauptmodul or fundamental function. 


Proof. (1) This property is clear from the properties of the discriminant and the 
Eisenstein series. The last two statements are proved by noting that G4(p) = 0 
and G6 (i) = 0, from formula (3.60). Thus A(i) = [60G4(i)]>. 

(2) To see this property, use formula (3.60) to show that f; = G} — sA can only be 
zero at one point. 

(3) To prove this property, first note that clearly (c)=(b)= (a). In order to prove 
(a)=(c), suppose that f(z) is a modular function. Then multiply f by a 
polynomial in J(z) to make a holomorphic function we will still call f on 
H. Then g = A”f is holomorphic at infinity for some n > 0, to put g in 
M(SL(2,Z),12n). Thus g is a linear combination of G4G2, with 4a+ 6b = 12n. 
It suffices to show (c) for f = G4G2/A". Now p =a/3 and q = b/2 are integers 
and f = G}P G4 /AP+4=(G} /A)” (G2/A)‘. Finally, it is straightforward to show 
that G} /A and G2 /A are rational functions of J, completing the proof. 


Exercise 3.4.5. Fill in the details in the proof of Theorem 3.4.2. 
Exercise 3.4.6. Produce some graphs of J(z). Compare with Fig. 3.34. 
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Fig. 3.34 A density plot of the argument of J(z) produced with the Mathematica command: 
DensityPlot{Arg[KleinInvariantJ[x+I y]],{x,-1,1},{y,0.2,1.5}, ColorFunction->Hue, PlotPoints- 
>400] 


One can use the automorphic function J to prove the small Picard theorem very 
quickly. The inverse function w = J~!(z) is infinite valued with branch points at 
0,1, œ (the values of J at the vertices of the fundamental domain). Suppose that f is 
an entire function omitting the values 0, 1, œ. Then J~! o f = g is holomorphic in the 
z-plane and single valued. Since g lands in the upper half-plane, exp(ig) is bounded 
and thus constant by Liouville’s theorem. It follows that f must be constant. 

Still another, short proof of the little Picard theorem can be found in Kobayashi 
[358]. Moreover, Kobayashi’s methods generalize to prove the big Picard theorem 
in higher-dimensional cases. 

Another application of the J-function is in number theory. For the maximal 
unramified abelian extension of an imaginary quadratic number field is produced 
by a value of j, also called the modular invariant. An explicit reciprocity law 
shows how the Galois group of the extension is isomorphic to the ideal class group 
of the quadratic field. This is explicit class field theory (see our discussion of 
number fields in Sect. 1.4). In the ramified case, other functions than j are needed. 
A reference for this is Borel and Chowla et al. [53]. 
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Martin Gardner wrote a famous April Fool’s column for the April 1975 edition 
of Scientific American in which he claimed (among other bogus claims) that 


e”V 163 is an integer. Of course e7V163 is transcendental by the theorem of Gelfand 


and Schneider (see Baker [21]). The fact is that j( ees v1) € Z. The Fourier 


q-expansion of j(z) in formula (3.72) implies then that eV 16? is within 12 decimal 


places of an integer. Gardner noted later in the Scientific American blog: “To my 
amazement, thousands of readers failed to recognize the column as a joke.” 
It turns out that the Fourier or g-expansion of j has only integer coefficients: 


(2) =q! +744 + 1968849 + 214937609 +- , (3.72) 


where q = exp(27iz). This is useful in the applications to number theory. There is 
also a surprising connection of the coefficients in Eq. (3.72) with the representations 
of the largest of the 26 sporadic finite simple groups which is known as the 
Fischer-Griess monster group M. The monster group has approximately 8 x 1053 
elements. It happens that all of the early Fourier coefficients in Eq. (3.72) are simple 
linear combinations of degrees of representations of M. This was first noticed by 
John McKay and John Thompson. Some interesting analogues for Lie groups are 
explained by Kac [334]. See also Conway [105] and Conway and Norton [106]. 
Conway reported on the situation as follows: 

Because these new links are still almost completely unexplained, we refer to them 

collectively as the moonshine properties of the MONSTER, intending the word to convey 

our feelings that they are seen in a dim light, and that the whole subject is rather vaguely 
illicit! 
In 1998, R. E. Borcherds won the Fields Medal in part because he proved the 
Conway-—Norton “moonshine conjecture.” See Borcherds [49]. 

The modular invariant j is important to algebraic geometers because it charac- 
terizes elliptic curves (see Shimura [589, p. 97]). We will say more about elliptic 
curves later in this section when we discuss elliptic integrals. The reason that the 
modular group is so called is that the moduli variety classifying elliptic curves is 
SL(2,Z)\H. 


3.4.4 Theta Functions 


The theta function is an important example for us (like G} and unlike J and 
A) because generalizations to higher dimensions are known. The theta function 
associated to a positive definite symmetric n x n real matrix P and a point z in 
the upper half-plane is 


0(P,z) = > exp(ziPlalz). (3.73) 


acz” 
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Here P[a] denotes the quadratic form associated to P; i.e., Pla] = ‘aPa, thinking of a 
as a column vector in Z”, as usual. Note that if z = it, for t > 0, them 0 (P,z) = 0 (P,t) 
as defined in Exercise 1.4.6 of Sect. 1.4. Note also that ifn = 1, P = 1, @(1,z) = 
0 (z), the function defined by formula (3.50) in Sect. 3.3. 


Exercise 3.4.7. Prove the transformation formula 


(pz) = P (E) ee, -1/3 


Hint. Recall Exercise 1.4.6 of Sect. 1.4. 


In order for the theta function defined by Eq. (3.73) to be a modular form as a 
function of z (with trivial multiplier system for the group SL(2, Z)), we need to place 
restrictions on P. 


Theorem 3.4.3. We say the positive definite symmetric n x n real matrix P is even 
integral if P|a| € 2Z for all a € Z”. If 0(P,z) is defined by formula (3.73) and P is 
even integral of determinant 1, then 8 divides n and 0(P,z) is in M(SL(2,Z),n/2). 


Proof. First we need an exercise. 


Exercise 3.4.8. Show that under the hypotheses on P in Theorem 3.4.3, the size n 
of the matrix P must be divisible by 8. 


Hint. (See Serre [576, pp. 53 and 109].) You can assume that n = 4(mod8) by 
replacing P by 


P000 

PO 0P00 
or 

OP 0o0PO 

000P 


Then let V = ST, with Sz = —1/z and Tz = z+ 1. Compute 0(P,V (iy)) for y > 0, 
as well as @(P,V° (iy)). Obtain a contradiction from V? = I. 


From Exercises 3.4.7 and 3.4.8, it follows that 0 (P,z+ 1) = 0 (P,z) and 0(P,z) = 
z"/?@(P,—1/z), if P satisfies the hypotheses of Theorem 3.4.3. For if |P| = 1 and 
P is integral, then, upon setting 


Sp = {P]a] |a € Z"}, 


we see that Sp = Sp-1. This is derived from the fact that we can write P~! [b] = Pla 
for b = P [a] € Z”. 


We call two positive definite real n x n matrices P and Q equivalent modulo 
GL(n,Z) if there is a matrix A in GL(n,Z) = the integral n x n matrices of 
determinant +1 or —1, such that P = Q[A] = ‘AQA. In fact, this equivalence relation 
will be quite important in Volume II [667] as an analogue of the equivalence relation 
defining the quotient space SL(2,Z)\H. 
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Exercise 3.4.9. Show that the following 8 x 8 matrix, which comes out of the 
theory of the exceptional Lie group Eg, gives an example of a matrix satisfying 
the hypotheses of Theorem 3.4.3. It can, in fact, be proved that this is the only 
possible example, up to equivalence modulo GL(8, Z). 


ooooroNnNo 


| 
ocooorFN OF 
l 
eooOoOrRNrFRF OS 
l l 
corne ooo 
l l 
l 


OorRNrR OCC Oo 


ooooco cron 
ePNrF COCO CO Oo 
Nr oooocne 


For n = 16, there are two examples of matrices satisfying the hypotheses of 
Theorem 3.4.3 and nonequivalent modulo GL(16,Z). For n = 24, there are 24 
examples nonequivalent modulo GL(24,Z) (see Serre [576, Chaps. 5 and 7]). 

If one considers relaxing the hypotheses on P in Theorem 3.4.3; e.g., allowing 
P to have arbitrary determinant, then one obtains a modular form attached to a 
congruence subgroup 


To(N) = { E r) é s1(2.2) Naivides e}. 


The multiplier system for such theta functions will not be trivial. The theory is 
harder for odd n, even if n = 1, which is the most widely used special case—the 
theta function corresponding to the Riemann zeta function by Mellin transform. 
The level N of the congruence subgroup is called the level of the quadratic form P 
and defined to be the least positive integer such that NP~! is even integral. For n 
even, it was proved by Hermite that 


ab 


O(P, yz) = €(d)(cz+d)‘0(P,z) for y= o p 


) ETo(N), k=n/2, 
where the multiplier system is given by 


_1)k 
e(d) = (sign d) (E) À (3.74) 


The symbol (—) in the multiplier system is the Kronecker symbol, which is a 
generalization of the Legendre symbol. The Legendre symbol (4) is defined, for 


a, p integers with p an odd prime not dividing a, by 
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(2) = if a = x° (mod p) has a solution x € Z, (3.75) 


P —1 otherwise. 

The definition of the Kronecker symbol can be found in most elementary number 

theory texts, and, for example, in Cohn [97], Grosswald [233], or Siegel [598]. 
The quadratic reciprocity law, first proved by Gauss (eight times), says that if 

p and q are distinct odd primes then 


QO- on 


It is possible to prove the quadratic reciprocity law (one of the most basic truths in 
number theory) using theta functions (see the next exercise). Hecke generalized this 
argument to obtain the quadratic reciprocity law for algebraic number fields, using 
theta functions (see Hecke [260] and Siegel [600, Vol. III, Paper 74]). Proofs of 
formula (3.74) can be found in Eicher [150, pp. 48-52], Ogg [505, Chap. 9] and 
Schoeneberg [562, Chap. 9]. The proof given by Eichler is quite interesting because 
it uses theta functions on the Siegel upper half-space (see Volume II [667] for an 
introduction to such Siegel modular forms). Moreover, Eichler’s proof works for an 
odd number of variables. The latter case is due to Pfetzer [513]. There is also a very 
interesting article by Eichler [151], which includes another proof of the result for 
an even number of variables plus some history and connections with other areas; 
e.g., the representations of SL(2,Z/nZ). Actually, one must generalize the theta 
functions defined in this section and introduce spherical harmonics in the sums (the 
same spherical harmonics that we considered in Sect. 2.1) in order to obtain cusp 
forms and bases for spaces of modular forms for Toọ(N) (see also Hijikata et al. 
[295], Serre and Stark [581]). For other discussions of theta functions and multiplier 
systems, see Knopp [357], Maass [437], Mumford [485], and Rankin [532]. 


Exercise 3.4.10. Prove the following formula of Landsberg and Schaar: 


ji 221 ( 24) ein/4 247! (r) 
— ) exp| 2min — } = exp ; 
P n=0 p) v24 2, 2q 


for any integers p and q. 


Hint. Use the transformation formula for theta (for z € H): 


X exp(inn’z) = iy? X exp(ian”(—1/z)). 


neZ neZ 


Let z = 2q/p + it and let t approach zero. It is possible to use this exercise to 
prove the quadratic reciprocity law (see Dym and McKean [147, pp. 222-226], 
and Bellman [33, Sect. 29]). 
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For the applications to elliptic integrals, it is useful to define slightly more general 
theta functions, similar to that in Exercise 1.3.8 of Sect. 1.3. These functions will be 
studied in the following pair of exercises. There are multitudes of theta functions, 
and the notation for these functions is definitely not standardized. So be careful if 
you consult references. Here are some books to consult, in addition to our standard 
list from the beginning of Sect.3.3 and the books just mentioned: Bellman [33], 
Erdélyi et al. [164], Hurwitz and Courant [312], Igusa [313], Lion and Vergne 
[423], Rauch and Lebowitz [536], and Whittaker and Watson [734]. Of course, some 
of these theta functions are known to Mathematica. For example, the Mathematica 
command EllipticTheta[a,u,q] gives the Jacobi theta function 0,(u,q) in 
Exercise 3.4.12. 


Exercise 3.4.11 (Another Theta Function). Define 


d(z,s)=> exp (mizn* + 2risn) , forze€H and seEC. 


neZ 


Prove the following statements about this theta function. 

(a) &(z,s) is an entire function of z € H and of s € C. 

(b) &(z,s) is quasi-periodic as a function of s, in the following sense: 
O (z,s +n) = B(z,5), forall ne zZ; 


V(z,s+zn) = exp |- ri (zn? +2ns)] (z, s), forall ne Z. 


(c) &(z,s) satisfies the transformation formula 


dai 
v (z,s) = E J ep (EE i 


neZ 


(d) V (z, (1 +z)/2) = 
(e) The only zero of V (s) = V (z,s) as a function of s, holding z fixed, for s in the 
period parallelogram on 1 and z is s = (1 +z)/2. Moreover, this zero is simple. 


Hints. For (c) recall Poisson summation. For (e) imitate the proof of formula (3.60). 
This theta function is considered by Siegel [596, Vol. II, pp. 158—172] and Mumford 
[485, Vol. I] along with generalizations to the Siegel upper half-plane. 


Exercise 3.4.12 (Jacobi’s Four Theta Functions). Define for q = exp(ziz),z€ H, 


(u,q) =25(- y gina) sin{(2n + 1)u]; 


02(u,q) = 2 > git a) cos[(2n + 1)u]; 
n=0 
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(uq) =1+2) q” cos[2nu]; 
n=1 


oo 


Q4(u,q) = 1+2 ¥ (-1)"q" cos[2nu]. 


n=1 


Prove that 
u 1 
63 (u,q) =v (z =) > 04(u,q) =v (£4 5) $ 


T , Tiz TZ 
@(u,q4)= 01 (u+ Z,a) ; 0i (u,q) = —iexp (u+ =) 04 (u+ =a) . 


We will show later that you can use products and quotients of the theta functions 
in Exercise 3.4.11 to obtain any elliptic function (see the discussion of elliptic 
integrals which follows). This aids in the computation of elliptic functions, because 
the series for theta functions converge rapidly, when the variable z is away from the 
real line. And you can use the transformation formula in part (c) of Exercise 3.4.11 
to deal with other values of z. Siegel extends these results to generalizations of 
elliptic functions called Jacobi-Abel functions in [596], using Siegel modular forms 
to be considered in Volume II [667]. The Jacobi-Abel functions are related to abelian 
integrals (see Sect. 3.4.5). 

Set 


011 (z, s) = exp [37+ ri(s + 5| v (es+ ta +2) : 

Mumford [485, Vol. I, p. 25] shows that, up to a constant, the Weierstrass sa-function 
is obtained from 1; (z,s) by taking the logarithm of theta and then two derivatives 
in the s-variable. We should caution the reader that our notation is the mirror image 
of Mumford’s. Mumford [loc. cit., p. 64] proves Jacobi’s derivative formula which 
shows that taking derivatives of 4; (z,s) with respect to s and then evaluating at 
s = 0 leads to cusp forms of weight 3 and level 4 (after squaring the result). And 
Mumford [loc. cit., p. 72] shows that A(z) can be written as a 24th power of a value 
of 0(z,5). 

Theta functions are viewed by algebraic geometers as providing “nice projective 
embeddings of a polarized abelian variety” (see Igusa [313] and Mumford [485]). 
There are connections between this view and the application to elliptic integrals 
which follows. A historical sketch explaining some of these things can be found in 
Shafarevitch [582, pp. 411-430 and Chap. 11]. Another reference is Hoobler and 
Resnikoff [300]. Much of the modern algebraic-geometric theory of theta functions 
(of higher genus) has been developed by Mumford (see Tate [657] and Mumford 
[482-485]). The higher-genus theta functions will be considered in Volume II [667]. 
The book of Mumford [485] is recommended as a nice introduction to theta 
functions which includes several applications to the solution of partial differential 
equations. 
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There are numerous applications of theta functions in number theory; e.g., in 
order to obtain correspondences between various spaces of automorphic forms 
(see Kudla [377,378], Stark [628], and Marie-France Vignéras [699]), to study 
representations of integers by quadratic forms and related questions (see Exer- 
cises 3.4.19, 3.4.20, and Siegel [600, Vol. I, pp. 326—405, 410-443, 469-548; 
Vol. II, pp. 1-7, 20-40]). Siegel’s work on quadratic forms finds a new adelic 
and representation-theoretic formulation in Weil [729, Vol. HI, pp. 1-157] and 
Tamagawa (see Kneser’s article in Cassels and Frohlich [79]). 


3.4.5 Elliptic Integrals and Elliptic Curves 


The name “elliptic integral” arises because the arc length of an ellipse is such an 
integral. An elliptic integral is an integral of the form f R(x,y)dx, where R(x,y) is 
a rational function over C (i.e., a quotient of polynomials with complex coefficients) 
and x and y are related by an algebraic equation which can be solved by substitutions 
x= f(u) and y = g(u), with f, g being elliptic functions; e.g., y = G(x), where G(x) 
is a third- or fourth-degree polynomial over C. Here an elliptic function means 
a meromorphic function of z € C which is doubly periodic; i.e., has two linearly 
independent periods over R. The best example is the Weierstrass f2-function of 
formula (3.64). 

Abelian integrals are generalizations of elliptic integrals, where the equation 
relating x and y is of higher degree. If the equation is y? = G(x), where G(x) has 
degree greater than 4, the integral is hyperelliptic. 

Elliptic functions arise in many ways in applied mathematics; e.g., in potential 
theory on rectangles, in problems involving pendulums, and in problems involving 
ellipses (see Jeffreys and Jeffreys [327, Chap. 25]). Perhaps one of the strangest 
things about elliptic functions is that, unlike the well-known functions of freshman 
mathematics, they satisfy nonlinear differential equations (e.g., see formula (3.63) 
for the Weierstrass ga-function, Exercises 3.4.14 and 3.4.21). 

Elliptic integrals arise, for example, when one uses the Schwarz—Christoffel 
formula to obtain the conformal mapping of the rectangle in Fig. 3.35 onto the upper 
half-plane. Here the points are mapped according to Table 3.4 and sn’ (0) = 1. 

The integral defining the inverse function for the Jacobian elliptic function w = 
sn(z,k) = sn(z) is 


z= f -w0 -n?) "dw. (3.77) 
0 


Note that the case k = 0 gives the ordinary trigonometric function w = sinz. 

The Jacobian elliptic function is a doubly periodic function with periods 4K and 
2iK’. The modulus k? (as well as k with 0 < k < 1) is uniquely determined by 
q = exp(int), T= iK/K’. In the theory of elliptic integrals, k? was always called 
the modulus and it turns out that 


3.4 Modular or Automorphic Forms-Classical 249 


z-plane 
iK' 
-K + ik’ K +iK' 


w =sn(z) w-plane 


-=K 0 K —1/k | 0 l I/k 


Fig. 3.35 Conformal mapping by sn(z) 


Table 3.4 How points in the rectangle of Fig.3.35 are 
mapped by the Jacobian elliptic function 


where A was defined in formula (3.65). This explains why we call À the “modulus.” 
References for these things are Copson [109, Chap. 14], Du Val [146], and Nehari 
[492, pp. 280-296], for example. 

The elliptic function sn(z) is used in applied mathematics to produce the potential 
distribution inside a rectangle when there is some kind of charge distribution on the 
boundary (see Morse and Feshbach [479, Vol. II, p. 1252]). Reading this page in 
Morse and Feshbach, one is struck by the regret with which they turn to higher- 
dimensional problems from those of the complex plane, for higher-dimensional 
problems seldom have such elegant solutions. The method of conformal mapping 
mostly fails. For Liouville proved that the only nontrivial conformal maps in 3-space 
are “mappings by reciprocal radii” (inversions in spheres)—a fact also bemoaned by 
Sommerfeld [610, p. 140]. 


Exercise 3.4.13. Use the Schwarz—Christoffel transformation to show that the 
Jacobian elliptic function has the properties that we claimed above. 


Exercise 3.4.14. Show that w = sn (z) satisfies 


(w)? =(1-w*)(1—- Pw’). 


Exercise 3.4.15. Show that if q =e", 


Bey An ( Fe). 


250 3 The Poincaré Upper Half-Plane 


Hint. Compare zeros and poles of the two doubly periodic functions. The functions 
0; and 04 are defined in Exercise 3.4.12. 


Exercise 3.4.16. Show that the constant A in Exercise 3.4.15 has the value 
oO) (0, q) 


7 03 (0, q) ` 
Also show that 


an 


Since the theta series converge rapidly, these formulas can be used to evaluate the 
functions above on a computer. 


There are many other known and useful formulas for these functions. We 
mention, for example, the relation between the modulus and J: 


_ 4 (1-A+A?)3 
~ 27 I-A ` 


Jacobi’s triple product formula says that if z = exp(2iu) 


oo 


03(u,q) = [[0 -PAUA H). 


n=1 


For a proof, see Hurwitz and Courant [312], for example. This and related identities 
for the Dedekind eta function have attracted much attention recently thanks to 
connections with Lie groups (see Kac [335] and MacDonald [441]). For more 
connections between theta functions and the theory of representations of Lie groups, 
see Lion and Michelle Vergne [423] and Mumford [485]. 

The words “elliptic curve” have arisen many times in this section. Let’s now seek 
to understand a bit of the basic facts about them. The most concrete way to define an 
elliptic curve over C is as the points (x,y) € C? satisfying the equation y? = ax? + 
bx? + cx +d provided that the discriminant of the cubic polynomial on the right is 
non-zero. The equation is said to be in Weierstrass form if it is y? = 4x? — g2x— g3 [as 
in formula (3.63)]. Suppose that L is the lattice Z@ TZ and go(z) = f2(z, T) = (z, L), 
with the Weierstrass function ¢2(z) defined by formula (3.64). Then the mapping that 
sends z € C? /L to (f(z), 7 (z)) gives an identification of the torus C? /L with points 
on the curve y? = 4x? — gox — g3. This allows one to add points on this curve by 
adding the corresponding points on the torus. That leads to the group of the curve. 
There is also a way to add points on a curve directly which is useful if you want 
to replace C by a finite field. The curve will have to have a well-defined tangent at 
every point since to add a point P to itself you need to intersect the tangent at P with 
the curve. The curve in Fig. 3.36 illustrates how to add point A to point B over the 
reals. 
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Fig. 3.36 A picture of the addition A + B = C on the elliptic curve y? = x — x where the field is R 


Once you have defined elliptic curves over C or R or Q, you can replace these 
fields by finite fields. Then it is possible to apply elliptic curves to cryptography. 
See Koblitz [360]. It is useful then to know some basic theorems about this case. 
For example, Hasse’s theorem says that if p is a prime and N, is the number of 
points on an elliptic curve over the finite field F,, then IN, p— (p+ 1)| < 2,/p. Thus 
we can write p + 1 — N, = 2,/pcos 0p. Independently around 1960 Mikio Sato and 
John Tate made a conjecture. The Sato—Tate conjecture for elliptic curves without 
complex multiplication says: 


b 
# rime <xanda<0,<b 2 
tim #1? Prime | p<xanda< 0) <b f _ = | sin? udu, (3.78) 


x> #{p prime | p <x } 


a 


This has been proved by Clozel et al. [93]. 

The popular groups for public-key cryptography using the RSA system were 
once the multiplicative groups of units (or invertible elements for multiplication) in 
the ring Z/pqZ, where p and q are very large primes. The difficulty of factoring 
the now very very large pq, makes the discrete logarithm problem in this group very 
very hard. However, in 1985 Neil Koblitz and Victor Miller proposed cryptosystems 
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based on elliptic curves. The groups of elliptic curves are the groups behind much 
of the encryption in use at the moment. They have the advantage of being much 
smaller than the groups needed for RSA cryptography. They make up much of the 
protection for your passwords on your Internet transactions. See Kristin Lauter’s 
article [397] for a brief introduction to the state of the art as of 2004. Koblitz [361] 
discussed the “drama and conflict ... inherent in cryptography, which, in fact, can be 
defined as the science of transmitting and managing information in the presence of 
an adversary.” Koblitz and Menezes [362] discuss mistakes in “proofs of security” of 
various cryptosystems and why, despite this, we should not worry about the security 
of internet transactions. 


3.4.6 The Connection Between Theta Functions and Coding 
Theory 


References for this subject are Sloane [606, 607], Jessie MacWilliams and Sloane 
[445], and Conway et al. [107]. The codes discussed here are the error-correcting 
codes, not the secret ones mentioned in the last paragraph on elliptic curve 
cryptography. Much of the foundations of error-correcting codes was developed at 
AT&T Bell labs. We shall consider only binary codes. These are vector spaces over 
the field Fz with two elements. 

Consider a noisy telephone connection transmitting Os and 1s. Sometimes a 1 is 
transmitted but is received as a 0. Thus the messages need to be encoded to include 
redundancy in order to correct errors. The coding theorist corrects errors by sending 
messages encoded into codewords which are vectors of Os and 1s. The Hamming 
distance between two codewords x,y is the number of indices i in the vectors x = 
(x1, ---; Xn) and y = (y1, ..., Yn) such that x; Æ yj. 


Definition 3.4.2. A binary code C of type [n,k,d] is a set of 2* vectors u = 
(uy, ..., Un) with u; € Z/2Z = F3 for alli=1, ..., n such that both of the following 
hold: 


(a) C is closed under addition (componentwise and mod 2). 
(b) The Hamming distance between two codewords is > d. 


Coding theorists have the incompatible goals of seeking n to be small for speed 
of transmission, k to be large for efficiency, and d to be large to correct many errors. 
It turns out that a code [n,k,d] can correct | (d — 1)/2] errors. 

Suppose that C is a linear code of type [n,k,d]; i.e., C forms a k-dimensional 
vector subspace of F} with scalars given by elements of the finite field F2. Then it 
is shown in Sloane [606] that one can obtain a lattice in R” from C in several ways 
(see Sect. 1.4 for the definition of lattice and the connection with positive definite 
symmetric matrices). One way of obtaining a lattice from C (and the only one that 
we shall discuss) is to set 


L(C) = { (w+ 2m)/Vv2 ue Cmez"t. 
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Here we think of u as a vector of Os and 1s. 

For example, the code C = {000,011, 101, 110} of type [3,2,2] gives rise to the 
face-centered cubic lattice (see Sect. 1.4) in this way and thus to the densest lattice 
packing of spheres in R? (see Volume II which is [667]). 

If u € C, the weight of u = wt (u) is defined to be the number of components u; of 
u such that u; 4 0. And the weight enumerator of C is a homogeneous polynomial 
of degree n in two indeterminants x and y defined by 


welx,y) = X, myn) 
uec 
It has been proved by Berlekamp, Jessie MacWilliams, and Sloane as well as Broue 
and Enguehard (see Sloane [606]) that defining the theta function associated to 
L(C) by 
ao) = DY exp(miciaa), z€H, 
aEL(C) 

then Orc) is actually a weight enumerator with Jacobi theta functions from 
Exercise 3.4.12 substituted for x and y. The explicit formula is 


Orco (z) = we(63(0,q),02(0,4)) if q= e. 


In particular, it is possible to use a code called the extended Hamming code Hg 
to obtain the theta function mentioned earlier, which is associated to the Lie group 
Eg (see Exercise 3.4.9). To obtain the Hamming code Hg, one writes down a matrix 
called the parity check matrix: 


11111111 
00011110 
01100110 
10101010 


Then the code words are obtained by choosing components u1, u2,u3,u4 arbitrarily 
in Fy while choosing the remaining components of u so that the equation Hu = 0 
holds (modulo 2, of course). This is 4 equations in 4 unknowns, so that the solution 
is unique. And Hg is a code of type [8,4,4]. 


3.4.7 Poincaré Series 


There is another example of a modular form which has been useful—the Poincaré 
series, which generalizes the Eisenstein series Gg defined in formula (3.59) by 
replacing 


(cz+d)* = r for y= ( € SL(2,Z), 
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Tage", 


for some holomorphic bounded function f : H —> C. Usually, a function such as 
f(z) = exp(2mimz) for m € Z is chosen (see Lehner [410, p. 275] and Rankin [532, 
p. 136]). Petersson showed that you can obtain a basis for M (SL(2, Z), k) of such 
Poincaré series (see Lehner [410, p. 293] and Rankin [532, pp. 153—159]). 

These Poincaré series are also useful in the study of automorphic forms for 
congruence subgroups (see Rankin [532, pp. 172—193]). In his article on the 
basis problem for modular forms on To(N), Eichler uses the Poincaré series with 
fa(Y:z) = x(Y)(1(z) —w)7!, where (y) = x(a), for y a character of the group 
(Z/NZ)* = the multiplicative group of integers a mod N with (a,N) = 1 (see 
Eichler [151, p. 122]). These Poincaré series are used to construct a Green’s function 
to be used in the trace formula. The Fourier coefficients of Poincaré series involve 
interesting arithmetical sums called Kloosterman sums (see formula (3.97) and 
Rankin [532]). We should note that one of the big problems with Poincaré series is 
that they may be identically zero (see Lehner [410, p. 25, pp. 290-291]). 

Finally, we should mention that Poincaré series (like Eisenstein series) are 
capable of extensive generalization. For example, Svetlana Katok [342] considers 
relative Poincaré series attached to hyperbolic elements % € T, where F is a 
Fuchsian group of the first kind with compact fundamental domain. Equivalently, 
one can say that these series are attached to closed or periodic geodesics in the 
fundamental domain of I. She proves that these series generate the whole space of 
cusp forms of weight greater than or equal to 4. These series are sums over 'p\T, 
where To is the centralizer in T of 7; i.e., To is the set of elements of IT commuting 


with y. If 
_ (abo 
pa & a l 


Qp (z) = coz” + (do — ao)z — bo. 


set 


Then the relative Poincaré series for % is essentially the sum over y representing 
To\F of 


; b 
Ow (yz) */? (cz+d)™ where y= ( *) 
The construction works for groups with cusps also (see Exercise 3.4.17). Further- 
more, Katok shows that the periods over closed geodesics play as important a role 
as the Fourier coefficients play for the group SL(2,Z) and congruence subgroups. 
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Exercise 3.4.17 (from Svetlana Katok [342]) (Relative Poincaré Series for SL(2, 
Z)). Let % € SL(2,Z) be a hyperbolic element. With the notation above, consider 
the relative Poincaré series defined by: 

S On (yz) "7 (ez+d)*, where y= G ;) € SL(2,Z). 

Tot cd 


Show that the series represents a modular form of weight k for k > 4. 


3.4.8 Some Estimates for Fourier Coefficients of Modular 
Forms 


We have noted previously the Ramanujan conjecture bounding Fourier coefficients 
of certain cusp forms for SL(2,Z). Here we look at non-cusp forms and we prove a 
weaker result for cusp forms. 


Theorem 3.4.4 (Estimates for the Fourier Coefficients of Modular Forms). The 
Fourier coefficients of the Eisenstein series G;(z) defined by Eq. (3.59) 


Gi(z) = X, anexp(2ninz), (k= 4,6,8, 10, 12,...) 


n>0 


satisfy 
An! <a, < Bn“! for some positive constants A,B. 


The Fourier coefficients of a cusp form f(z) of weight k for SL(2,Z) with 


f2) = » anexp(2rinz) 


n>1 


satisfy the Ramanujan—Petersson bound (proved by Deligne [126, 127]): 
lan| < cnt te, forany €>0. 
Proof (of a weaker result). To prove the estimate for the coefficients of Gg, note that 
Exercise 3.4.2 implies that 
an = Cop (N) 
for some constant C Æ 0, where o; denotes the divisor function 


O;(n) = bY d, 


0<d|n 
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where the sum is over all positive divisors d of n. We can estimate o,(n), using the 
Riemann zeta function, as follows: 


co(n) = > d =n > d*<n'C(s), 


0<d|n 0<d|n 


ifs > 1. 

The estimate for cusp forms comes from Deligne’s proof of the Weil conjectures 
on zeta functions of n-dimensional projective nonsingular varieties over finite fields, 
as we mentioned earlier in our discussion of Ramanujan’s tau function. Deligne’s 
proof cannot be discussed here. Instead we give an exercise proving a weaker result. 
See Exercise 3.4.18. 


Exercise 3.4.18 (A Weak Estimate for the Fourier Coefficients of Cusp Forms). 
Show that if f(z) is a cusp form of weight k for SL(2,Z) as in Theorem 3.4.4, we 
have the inequality |a,| < Cn*/?, for some positive constant C. 


Hint. Since f(z) vanishes at infinity, we can factor exp(2miz) out of the Fourier 
expansion and obtain |f(z)| < Cexp(—2my). Since the function g(z) = |f(z)|y*/? 
is invariant under SL(2,Z) and approaches zero as y approaches infinity, it is 
bounded by some constant M. It follows that the Fourier coefficient day satisfies 
\dn| < My~*/? exp(2mny) for all y > 0. Take y = 1/n to complete the proof. 


The idea of Exercise 3.4.18 is Hecke’s. Classical improvements in the estimate 
were obtained by Rankin [534] and Selberg [570], among others. Note that the 
proof in Exercise 3.4.18 applies also to noncongruence subgroups and nonintegral 
weights, as well as to the nonholomorphic modular forms to be considered in the 
next section, while Deligne’s method does not apply to these other situations. 

There is an interesting question raised by Atkin and Swinnerton-Dyer. The 
Fourier coefficients for modular forms on congruence subgroups have “bounded 
denominators” in the sense that the forms are linear combinations of functions with 
coefficients that are integers in some algebraic number field (of finite degree over 
Q). This is not the case for noncongruence subgroups. 


Question. Do all of the modular forms for noncongruence subgroups have this 
“bad” property unless they are really modular forms on congruence subgroups? 


3.4.9 The Representation of Integers by Quadratic Forms 


Let P be a positive definite even integral matrix P of determinant one as in 
Theorem 3.4.3. Define the representation numbers rp(m) by 


rp(m)=#{a € Z” | Pla] =2m}. 
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It follows from what we know about modular forms of weight k for SL(2,Z) that 
there is a positive constant C such that for every € > 0, 


Ik k-1 
rp(m) — BO (m)| < Cm? ”, where k = n/2, k even integral. (3.79) 


Here B; denotes the kth Bernoulli number defined in Sect. 1.4. 


Exercise 3.4.19. Prove the estimate Eq. (3.79). 


Hint. From Theorem 3.4.3, you know that 0(P,z) E€ M(SL(2,Z),n/2). So there is 
a constant c; and a cusp form fg such that 


O(P, z) = ccGy(z) + f(z). 


Compare constant terms in the Fourier series for both sides of this equation to 
see that cy, = [2¢(k)]~!. This allows you to evaluate the mth Fourier coefficient of 
cxG;(z), using Exercise 3.4.2. Use Euler’s formula for the Riemann zeta function at 
even integers, which was mentioned in Sect. 1.4 and Theorem 3.4.4 (and proved in 
Exercise 3.5.7) to finish the exercise. 


It would be more interesting perhaps to obtain formulas for the representation 
numbers of P = I = the n x n identity matrix. Then one would have a formula for 
the number of ways of representing m as a sum of n squares. Such results for certain 
n are considered in Rademacher [525, Chap. 11], and Knopp [357, Chap. 5]. Siegel 
answers much more general questions than these in his work on quadratic forms 
mentioned earlier. 


Exercise 3.4.20. Show that if n = 8 and P is a positive definite 8 x 8 symmetric 
matrix satisfying the conditions of Theorem 3.4.3, then 


rp(m) = 24003(m). 


3.4.10 Korteweg—DeVries Equation 


The nonlinear partial differential equation often written in the form us + 6uu, + 
Uxxx = 0 was derived by Korteweg and DeVries in 1895 in a study of long water 
waves in a rectangular canal. If you replace u by —u, the equation becomes 
Ut — 6uuy + Uxxx = 0. Often the constant 6 is replaced by 12 as well. Of course, 
in physical problems the constants would reflect the properties of the materials 
involved. The KdV equation applies to many other situations (e.g., plasma physics) 
in which solitons occur. This means that waves show a particle-like behavior in 
interactions; that is, the interacting waves keep their shape and amplitude while 
undergoing a shift. Korteweg and DeVries found a solution involving the hyperbolic 
secant and one involving elliptic functions. See the book of Lonngren and Scott 
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[429] for various articles on solitons and the KdV equation (particularly the articles 
of Miura and Hermann). Another reference is Kasman [339]. The mandatory 
quotation on the subject is from J. Scott-Russell’s “Report on Waves” from 1844. I 
found it in Keener [348, p. 408]. 


‘I was observing the motion of a boat which was rapidly drawn along a narrow channel by 
a pair of horses, when the boat suddenly stopped—not so the mass of water in the channel 
which had put it in motion; it accumulated round the prow of the vessel in a state of violent 
agitation, then suddenly leaving it behind, rolled forward with great velocity, assuming the 
form of a large solitary elevation, a rounded, smooth and well-defined heap of water, which 
continued its course along the channel apparently without change of form or diminution of 
speed. I followed on horseback, and overtook it still rolling on at a rate of some eight or 
nine miles an hour, preserving its original figure some thirty feet long and a foot to a foot 
and a half in height. Its height gradually diminished, and after a chase of one or two miles 
I lost it in the windings of the channel. Such, in the month of August 1834, was my first 
chance interview with that singular and beautiful phenomenon. ... 


Exercise 3.4.21 (The Korteweg—DeVries Equation). Show that the Weier- 
strass g2-function which satisfies the nonlinear differential equation given in 
formula (3.63) also represents a travelling wave solution of the Korteweg—DeVries 
equation u, — 6uuy + Uxx» = 0 of the form u = w(x + ct). This gives after a couple 
of integrations the ODE: (w’ y = 2w? — cw? — gw — h, where g and h are constants 
of integration. This ODE now begins to resemble the differential equation for the 
Weierstrass g7-function. See Kasman [339] for more details. 


3.5 Modular Forms: Not So Classical—Maass Waveforms 


Il a fallu Maass pour nous sortir du ghetto des fonctions holomorphes.> 


—From Weil [729, Vol. IH, p. 463]. 


3.5.1 Maass Waveforms 


Our main goal for the remainder of this chapter is to develop harmonic analysis 
on SL(2,Z)\H. That is, we seek the spectral decomposition of the non-Euclidean 
Laplace operator A on H into SL(2,Z)-invariant eigenfunctions. This is a non- 
Euclidean analogue of Fourier series. However, because the quotient SL(2,Z)\H 
is not compact, there is a continuous spectrum as well as a discrete spectrum, so 
that one has a mixture of Fourier series and integrals. In this section, we shall study 
the eigenfunctions of A on SL(2,Z)\H. We will find that these eigenfunctions have 
much in common with the classical automorphic or modular forms considered in 


>We needed Maass to help us get out of the ghetto of holomorphic functions. 
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the preceding section. So we shall also call these eigenfunctions of A “modular 
forms.” But usually we will call them “Maass waveforms,” because they were first 
systematically considered by Maass [439] in 1949. We will find that the only 
known explicit example of a Maass waveform for SL(2,Z) is the Epstein zeta 
function from Sect. 1.4, which can be viewed as a nonholomorphic analogue of the 
Eisenstein series in Sect.3.4. However, analogues of cusp forms are also known 
to exist, but we cannot produce examples such as the discriminant function A in 
Sect. 3.4, except for congruence subgroups (since the product of eigenfunctions of 
the Poincaré Laplacian (which we also call A) will not usually be an eigenfunction 
of A). However, computer approximations of Maass cusp forms are plentiful. 

Some references for this section are Borel and Casselman [52], Borel and 
Mostow [54], Bump [71], Cartier and Hejhal [78], Delsarte [129, Tome II, pp. 599- 
601, 829-845], Elstrodt [155, 156], Faddeev [171], Fay [176], Gangolli [191], 
Gangolli and Warner [195, 196], Gelbart [200], Gelfand et al. [203], Goldfeld and 
Hundley [219], Goldfeld and Husemoller [220], Hejhal [261—263, 266], Huber 
(306, 307], Iwaniec [319], Kubota [375], Lachaud [382], Lang [389], Lax and 
Phillips [400], Maass [437,439], Moreno [477], Roelcke [544-546], Sarnak 
[555,556], Selberg [569-571,573], Venkov [693-695], Weil [728], and Zagier 
[750]. 


Definition 3.5.1. A function f : H — C is a Maass waveform (or nonholomor- 
phic modular or automorphic form) if it satisfies the following three conditions: 


(1) f is an eigenfunction of the non-Euclidean Laplacian; i.e., Af=A f, for some 
à €C, where A = y?(07/dx? +07/dy). 

(2) f is invariant under the modular group; i.e., f(yz) = f(z) forall y€ T = SL(2, Z) 
and all z € H. 

(3) f has at most polynomial growth at infinity; i.e., there are constants C > 0 and 
k such that | f(z)| < Cy*, as y > © uniformly in x. 


Here N'(SL(2,Z),A) will be our notation for the vector space of such Maass 
waveforms. Since Af = Af is an elliptic partial differential equation, the Maass 
waveform f is automatically real analytic (see Garabedian [197, Chap. 5]). There 
is a discussion in Maass’s lectures [437, Chap. 4] of a generalization of the Maass 
waveform which contains the classical holomorphic modular form from Sect. 3.3 
as well. A fundamental application of Maass waveforms for discrete subgroups 
T of SL(2,R) with compact quotients [\H was already noted by Delsarte in 
1942 [129, Tome II, pp. 599-601]. Then in the 1950s Selberg developed his trace 
formula, along with many number-theoretical applications, some of which were 
found independently by Eichler (see Selberg [569] and Eichler [152]). We shall 
discuss the trace formula in Sect. 3.7. 


® Actually, you must multiply Maass’s functions g in [SL(2,Z),s,s,1] by y’ in order to obtain our 
functions f in N (SL(2, Z),s(s—1)). 
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Example 3.5.1. (Eisenstein Series—Alias Epstein’s Zeta Function) As in 
Sect. 3.2, the easiest way to construct eigenfunctions of A is to build them up 
out of the power function 


ps(z) = (Imz) =y", ifz=x+iyed, (3.80) 


for clearly, Aps = s(s — 1)ps. Formulas (3.16) and (3.25), along with the exercises 
following those formulas, show that the K-Bessel and associated Legendre functions 
are built up out of power functions by integration over the appropriate subgroup of 
SL(2,Z). In the present situation we must sum rather than integrate, since SL(2, Z) 
is a discrete subgroup. The Eisenstein series E,(z) for s € C with Res > 1 and 
z € H, is defined by 


E,(z)= > ps(yz), (3.81) 
yeTo\P 
where 
ew su(2,2) =T} 3.82 
Di 0 +1 s ( , J= ` ( ) 


The notation in Eq.(3.81) means that the sum runs over a complete set of 
representatives y € T for the quotient T. \I. It is necessary to “mod out” T., because 


Ps(¥Z) = ps (z) for all yET..,z€H. 


Note that T is the stabilizer in I of the cusp at infinity (i.e., the subgroup of 
T fixing œ). When there are more cusps in the fundamental domain for the discrete 
group I, there will be more Eisenstein series (one for each T -inequivalent cusp). The 
convergence in formula (3.81) is absolute and uniform on compacta in the s-plane, 
by arguments given in Sect. 1.4 for Epstein’s zeta function, using the following 
exercise. 


Exercise 3.5.1 (The Connection Between Eisenstein Series and Epstein Zeta 
Functions). Let Y be in 


SP, = {Y € R”? | Y positive definite, |Y| = 1}. 


Define Epstein’s zeta function for Res > 1, Y € SP2, as in Sect. 1.4, by 


where Y [a] = ‘aYa, viewing a € Z? — 0 as a column vector. 
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Recall that, according to Exercise 3.1.9 of Sect. 3.1, we can identify H and SP2 
via z =œ W; where z = x + iy and W; € SP3 is defined by 


maa fo TC est): 


(a) Show that if a = ( ") € R2, then 
—m 


W. [a] = 'aW.a = y~! |mz+ nb.. 


(b) Show that if Res > 1, then the Eisenstein series (3.81) is given by 
y —2s 
E(zv=— } |mz+n\. 
(m,n)=1 
Here (m,n) = the greatest common divisor of m and n. 


(c) Prove that ¢(2s)E,(z) = Z(W-,s), if ¢ (s) = Riemann’s zeta function. 
(d) Show that Z(W,s) = Z(W~!,s) if W € SP2. 


Hints on part (b). You must show that the quotient T..\T has as representatives 
* Ok 
mn 
with (m,n) = 1. This is a consequence of the fact that 
+1 q ab\ [* * 
O Als hed J  \ cd. 


You also need to know that a row vector in Z? can be completed to a matrix in 
SL(2,Z) if and only if the gcd (=greatest common divisor) of its entries is one. 


Hints on part (c). In the definition of Z(W,s), the sum runs over all (m,n) € 
Z? —0. Factor the greatest common divisor out of m and n. 


Hints on part (d). If W is a2 x 2 positive matrix of determinant 1, then 


—d 
Wolo | —w . 
Haka 
From Exercise 3.5.1, we see that Es E€ N(SL(2,Z),s(s—1)) when Res > 1, 
provided that E,(z) has polynomial growth as y — ©. 
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Exercise 3.5.2. 


(a) Show that E; (yz) = Es (z) for all y € SL(2,Z) and all z € H, if Res > 1. 
(b) Show that E, (z) ~ y5, as y > ©, for s fixed with Res > 1. 
(c) Conclude that E, (z) € V(SL(2,Z),s(s—1)) when Res > 1. 


Hint on part (b). Use the formula from part (b) of Exercise 3.5.1. 


Theorem 1.4.1 of Sect. 1.4 shows that E;(z) has an analytic continuation to the 
entire complex s-plane as a meromorphic function of s. And E,(z) has a pole at s = 1 
with residue 3/7 = (vol(SL(2,Z)\H))~!. Moreover, the Eisenstein series satisfies 
the functional equation’ 

A(s)Es(z) =A(1—s)E\-s(z) if A(s) = m “T(s)€(2s). (3.83) 
There must be a functional equation relating E; and E;_, because both functions lie 
in N(SL(2,Z),s(s—1)), which will be proved to be one-dimensional for Res 4 5. 

Since Es(z) = Z(W;,s)/¢ (2s), any zero of ¢(2s) will produce a pole of E;(z) 
unless Z(W,,s) vanishes at that value of s. The trivial zeros of Z(W-,s) and ¢ (2s) 
are both s = —1, —2, —3, ... ; and both have order 1. Thus the trivial zeros cancel 
out in E,(z). The Euler product for the Riemann zeta function in formula (1.21) 
and Exercise 3.5.8 show that ¢ (2s) Æ 0 for Res > 5. This is true in a larger region, 
but the Riemann hypothesis that the nontrivial zeros of €(2s) must lie on the line 
Res = 1 remains unproved after more than 100 years. One of the motivations for the 
study of harmonic analysis on ’\H is the desire to understand the complex zeros of 
€(s) (see Hejhal [262, p. 479] or [263], Cartier and Hejhal [78], or Zagier [750, 
p. 276]). Note that the zeros of Z(W,s) depend on W and, in general, do not satisfy 
the Riemann hypothesis (see Stark [629] and Titchmarsh [680, p. 244]). They tend, 
however, to lie on the line Res = 5. but can also be found in Res > 1. 

From Exercise 3.5.2 and the functional equation (3.83) we see that 

A(s)E,(z) ~ A(s) +A (1 — s)y!™ as y > o. 
In fact, it is not hard to obtain the complete Fourier expansion of the Eisenstein series 
(see Exercises 3.5.3 and 3.5.4). The result of Exercise 3.5.4 is a very useful one in 
number theory for it reveals much about the Eisenstein series. It is thus reminiscent 
of the incomplete gamma expansion of Epstein’s zeta function (Theorem 3.4.1 of 


Sect. 1.4). This might lead one to say that these formulas are the meat and potatoes 
(or quiche & salad, or tofu & rice) of the subject. 


Exercise 3.5.3 (Fourier Expansions of Maass Waveforms). Show that a modular 
form f € N(SL(2,Z),s(s— 1)) has a Fourier series expansion as a periodic function 
of x given by 


fe) =a tby +E Any/YK,_ 1 (22|n|y)exp(27inx), 
n#0 
where K,(y) is the K-Bessel function in Sect. 3.2. 


TWe shall use the letter T for discrete groups such as SL(2,Z) as well as for the gamma function. 
Hopefully the meaning will be clear from the context. 
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Hints. Use separation of variables in Af = s(s—1)f, as in Exercise 3.5.4 of 
Sect.3.2, and rule out the second solution /;(y) of the second-order ordinary 
differential equation for K;(y), by the polynomial growth of f(z) as y > © (see 
Lebedev [401, p. 123]). 


Exercise 3.5.4 (The Fourier Expansion of the Eisenstein Series). Set A(s) = 
m “I(s)€(2s) and E*(z) = A(s)Es(z). Show that Ef (z) has the Fourier expansion 


E*(z) = y*A(s) +y!A(1 —s)+2 £ Ia! -z(n)y PK,- 12 27]aly)e 
n#0 


Here o;(n) is the divisor function defined by 


ee st 


0<d|n 
where the sum runs over all positive divisors d of n. 


Hints. 
Method 1 (Chowla and Selberg [88]). Start with the Mellin transform result in 
Exercise 1.4.7 of Sect. 1.4: 


17 1 
d=5 f cuts Z en(- mwg |) qr (3.84) 


Now, using the formula for W; in Exercise 3.5.1, we have 


W: H =y! (a— xb}? + yb’. 


b 
Thus the b = 0 term of the sum in Eq. (3.84) is integrated to obtain A(s)y°. When 
b + 0, the variable a is summed over all of Z, and thus one can apply the Poisson 
sum formula and obtain, as in Exercise 3.4.11 of Sect. 3.4 


ey exp [=z (a— xb) |= i £ exp |-x} a’ +2riabx (3.85) 


acZ acZ 


After pulling out the b = 0 term, substitute Eq. (3.85) into Eq. (3.84). Then use the 
integral formula for K,(z) in Exercise 3.2.1 of Sect. 3.2 to complete the proof. 
Method 2 (Bateman and Grosswald [30] or Terras [665]). Apply the Poisson 
summation formula directly to the series defining Z(W,s), after taking out the b = 0 
term of the sum over (a,b) € Z? — 0, as in Method 1. 

Method 3 (Kubota [375, pp. 13-17]). Note that 


E,(z) =y +y £ [lez +d|7” — pty £ cs 5 lz+d/c|-*. 
gced(c,d)=1 col gcd(c,d)=1 
cl 
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Let d=r+cn, 0<r<c,n€Z. Use Poisson summation on the sum over n to 
complete the proof. You will also need the identity relating the divisor function and 
the singular series (see Hardy [249, p. 141]): 


C(2s) $, c” exp(2mimd/c) = o1~25(m). 
c>0,d mod c 
gced(d,c)=1 


The name “singular series” was used by Hardy and Littlewood in their work on 
Waring’s problem (see Hardy [250, Vol. I, pp. 377—532]). It was applied to Fourier 
coefficients of Eisenstein series by Siegel [600, Vol. 1, p. 329] and Maass [438, 
pp. 300-313]. This third method is related to the Bruhat decomposition of SL(2,Q) 
(see Volume IT i.e., [667]). 

The singular series form of the Fourier coefficients of Es(z) in Method 3 of the 
preceding exercise involves Ramanujan sums: 


c-(n)= X, exp(2aimn/r). 


1l<m<r, 
gcd(m,r)=1 


Ramanujan studied various formal trigonometric series involving these sums (in- 
cluding that obtained from E;(z)). Kac [330, pp. 86-96] showed that such series 
can often be viewed as Fourier expansions of almost periodic functions. See also 
Delsarte [129, Vol. II, pp. 603-624]. 


Exercise 3.5.5. Show that at s = 1/2 the poles of the constant term in the Fourier 
expansion of the Eisenstein series E¥(z) in Exercise 3.5.4 cancel. Evaluate the 
resulting constant in terms of known constants like x, y =Euler’s constant, etc. That 
is, evaluate 


lim (A(s)y* +y' SA(1—s)), where A(s) =a °T(s)C(2s). 


Sx 


Exercise 3.5.6 (Kronecker’s Limit Formula). Use Exercise 3.5.4 to show that if 
W, is as in Exercise 3.5.1 


sl 


im {204.5 afr log? —log( y5|n œ+ 5) } 


where y = Euler’s constant defined by 


24 
y= lim (3 Z-ig = 0.577215..., 
m 


n—oo 
m=1 


and n (z) = Dedekind’s eta function from Sect. 3.4. 
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Hint. Note that K, (y) = (x/2y)2e Ə and use the definition of eta as an infinite 


product. 


The result of Exercise 3.5.6 goes back to Kronecker [374, Vol. 4, pp. 222, 347- 
495, Vol. 5, pp. 1-132], along with a second limit formula for the function 


> Wla| °exp(2ai'qa) forW € SP2, q € R°. 
acZ2-0 


Interesting discussions of this result can be found in Siegel [599], Weber [724, 
Vol. 3], and Weil [727]. The Kronecker limit formulas are of central importance for 
the construction of class fields of algebraic number fields, as Kronecker had already 
demonstrated in 1863, by proving that the limit formula gives a solution (x,y) € Z? 
of Pell’s equation 


x? —dy’ =+1 (ifd is a given positive integer) 


in terms of modular functions eta and theta. Some related references are Goldstein 
[224], Hecke [258, pp. 198-207, 290-312], Katayama [340], Lang [390], Meyer 
[466], Ramachandra [529], Shintani [594], and Zagier [751]. Stark [626, part 4] 
uses Kronecker’s limit formula to prove Stark’s conjectures on values of L-functions 
when the base field is Q or an imaginary quadratic field. This gives explicit 
reciprocity laws in some abelian extensions of the base field. Stark [630] and Gupta 
[237] use the Kronecker limit formula to obtain an analytic proof of the Coates— 
Wiles theorem on L-functions for elliptic curves with complex multiplication. 

There are other sorts of applications of the Fourier expansion in Exercise 3.5.4. 
Stark [625] uses generalizations of that result to show that there are exactly nine 
imaginary quadratic fields with class number 1. The Fourier expansion can also be 
used to obtain comparisons of values of the Riemann zeta function at even and odd 
positive integers 2n and 2n + 1 in terms of rapidly converging series of exponentials 
(see Terras [666] and Exercise 3.5.7). One has Euler’s formula for €(2n), n = 
1,2,3,..., but there is no simple interpretation for €(2n + 1), n = 1,2,3, .... There 
are, however, conjectures of Lichtenbaum relating these odd values to orders of K- 
groups (see Lichtenbaum [420]). In Volume II [667] we will find that ¢(3) does 
appear in the volume of the fundamental domain for SL(3, Z). This fact was already 
noted by Siegel, but the Gauss—Bonnet theorem does not apply in this case to relate 
¢(3) to a rational number times some power of m (see Weil [729, Vol. I, p. 561]). 
Apéry showed that ¢ (3) is indeed irrational (see Van der Poorten [690)]). 


Exercise 3.5.7. 


(a) Prove Euler’s formula: 


6 (2n) =~ 2n)" Ban, n= 1,2,3,.... 
Nn): 
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Fig. 3.37 Contour for the evaluation of ¢ (2n), n € Z+ 


Here B,,= the nth Bernoulli number defined by 


x Bn 
ex—] = 2 a 
n>0 °°" 
(b) Use Exercise 3.5.4 to show that 
2 3 —2nn 22 
C(3) = Ww — 4 Y eo (n) | 2n7n? + an+ 
45 il 2 


(c) Show that 


1 
lim (<¢ (s) ) = y= Euler’s constant. 


~ #21 


Hint on Part (a). Look at the contour Cp in Fig. 3.37 and let R > œ in 


[ (e&—1) 'z "dz, 
Cr 


using the Cauchy integral formula and residue calculus. 
Hint on Part (b). See Terras [666]. 
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Exercise 3.5.8. Show that ¢(1 + it) 40 for allt in R. 


Hints. (See Jacquet and Shalika [324] and Zagier [750].) Suppose that 
C(1+it)=0. Then E(j4i)/2(z) is of rapid decay as y — œ, according to 
Exercise 3.5.4. But then if s = (1 + it)/2, we have 


1 
C(Es,y) = f Es(x+ iy) dx 
x=0 
= (the constant term in the Fourier expansion of E,) = 0. 


Consider 
I(Es,r) = E C( (Es,y)y F ?dy 
=Í E,(z)y?dx dy = = E,(z)E,(z)y~* dx dy. 
T..\H 


Here I’... is as in Eq. (3.82). Let r = F: Then /(E,,5) is the square of the L?-norm of 
E, and we have a contradiction. 


The result in Exercise 3.5.8 is quite old and is fundamental for proofs of the 
prime number theorem using Tauberian theorems (see Wiener [736, pp. 112—121]). 
As Selberg [571] noticed, the preceding exercise is easy once one has the analytic 
continuation of E,(z) to Re s > 5 with the only pole occurring at s = 1, for one 
need only examine a nonconstant Fourier coefficient of E,(s) [which has ¢ (2s) in 
the denominator] to see that ¢(2s) cannot vanish on Re s = 4. The usual proofs of 
Exercise 3.5.8 are quite different (see Grosswald [233, p. 131]). 

Related Fourier expansions of Eisenstein series have been used by Kubota [376] 
as well as by Heath-Brown and Patterson [257] to study cubic Gauss sums. One 
can prove Gauss’s conjecture that the average order of the number of classes of 
positive integral binary quadratic forms of discriminant —D is 2nD!/?/(7¢(3)) by 
eoun class numbers as Fourier coefficients of holomorphic Eisenstein series 
of weight 3 (see Hecke [258, pp. 499-504]). Goldfeld et al. [218] use similar ideas 
to study thie arithmetic of elliptic curves. 


Exercise 3.5.9. 


(a) Let BC” (T\H) denote the set of bounded C” functions on F\H. Show that if A 
denotes the Poincaré Laplacian and f, Af both lie in BC” (T\H) then (assuming 
that f is real-valued) we have: 


ann=- f (GAGE Jasso 


T\H 
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where the inner product is 


(F.2)= | POD Pasay. 


(b) Deduce that if A is an eigenvalue of A on L? (T\H), A = s(s— 1), then Res = 5 
or s € [0,1]. 

(c) Suppose that f,g € BC” (T\H) are real-valued functions such that Af and Ag 
are also in BC” (T\H). Then A is symmetric; i.e., 


(Af, 8) = (g, Af). 


Hint. (See Lang [389, pp. 281-284]) Use Green’s theorem on a truncated 
fundamental domain as in Fig. 3.38. 


Using the notation of Exercise 3.5.9, let Da be the space of functions f in 
BC*(T\H) such that Af is also in BC*(I'\H). Then D4 is a dense subspace of 
L? (T\H) and A can be extended to a self-adjoint operator (see Lang [389, pp. 284— 
287]). This is done using the resolvent (A — s(s— 1))~!. Exercise 3.5.9 shows that 
the operator is negative. 


3.5.2 Maass Cusp Forms 


There are (sometimes) other members of N (T, 2) besides Eisenstein series. Recall- 
ing what happened in the last section for holomorphic modular forms, we expect 
to see cusp forms. We will have existence theorems and computer approximations 
but sadly so far none of the many explicit examples like A from the last section. 
The only explicit examples of Maass cusp forms come from Hecke L-functions with 
grossencharacters and they are for congruence subgroups not the full modular group 
(see Maass [439] and Hejhal and Str6mbergsson [275] who call them CM Maass 
forms). 


Definition 3.5.2. We say that f € N (T,A) is a Maass cusp form for T = SL(2,Z) 


if the constant term in the Fourier expansion in Exercise 3.5.3 vanishes; i.e., 


1 
J Fat inde =0=ay’ + by! for all y>0. 
x=0 

Let SN (T, A) denote the vector space of Maass cusp forms in (I, /). 


Exercise 3.5.10. Show that if f is a cusp form then f € L?(T\H), using the 
invariant area element y~7dx dy. 
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Hint. From the Fourier series for f and the asymptotic behavior of Ks(y) as y + ©, 
you can show that a cusp form must be bounded and thus is square-integrable since 
the fundamental domain has finite area (see Sect. 3.3). The Fourier series for f 
converges uniformly on the fundamental domain by the Weierstrass M-test, once 
you have a bound on the Fourier coefficients (see Theorem 3.5.2). 


The Eisenstein series E, is not a cusp form. 


Exercise 3.5.11. Show that if A(W,s) = 2 “I (s)Z(W,s), for W € SP2, s £0,1, 
then 


A(W,s) € L'(\H,y 7dxdy) if 0<Res<1 (and not otherwise); 
A(W,s) ¢ L’(\H,y *dxdy) forall s. 


Hint. As in Exercise 3.5.10, make use of the Fourier series for A(W,s) in 
Exercise 3.5.4. 


Now we want to study the structure of the space of Maass forms NV (SL(2, Z), AÀ) 
more closely, recalling what happened for classical modular forms in Sect. 3.4. 
Given f in M(SL(2,Z),k) = the classical holomorphic modular forms of weight 
k, there is always a constant c such that f — cG, is a cusp form, where G} = the 
holomorphic Eisenstein series. This is not so clear in the nonholomorphic case, 
since the Fourier expansion of f in NV (SL(2,Z),s(s—1)) begins with ay’ + by!-S, 
rather than with a constant. Suppose that Re s > 1, s ¢ (0, 1]. Then we can find a 
constant c such that (f — cE,) has constant term by =s. This implies that (f — cE;) is 
square-integrable over the fundamental domain, using the non-Euclidean invariant 
area element y~*dx dy. But this contradicts the fact that the Laplace operator is 
negative on the fundamental domain (see Exercise 3.5.9). We have thus proved part 
(a) of the following theorem. 


Theorem 3.5.1. 
(a) IfRes > 5, ands ¢ [3,1], then 


N (SL(2,Z),s(s—1)) = CE,. 


(b) N(SL(2,Z),0) =C. 
(c) IfRes = 4 ors € [5,1), then 


N(SL(2,Z),s(s— 1)) = CE, 8SN(SL(2,Z),5(s—1)). 
Note. Later we shall prove that if s € [5,1), then SN(SL(2,Z),s(s—1)) = {0} 
(see Theorem 3.5.3). 


Proof. These results would be easy if the constant term in the Fourier expansion of 
a Maass waveform did not have two parts, for you could easily subtract a multiple 
of the Eisenstein series to produce a cusp form. And a harmonic cusp form (i.e., a 
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Fig. 3.38 Fundamental domain truncated at y = Y 


cusp form f such that Af = 0) would have to be identically zero by the maximum 
principle for harmonic functions (see Garabedian [197]). 

In order to circumvent this difficulty, we must prove the one-dimensionality 
of the vector space V consisting of constant terms in the Fourier expansion of 
f € N(SL(2,Z),A). One proof of this fact uses Green’s theorem on the truncated 
fundamental domain 


by={zeH 


1 1 
>1,R —-=,-|,I <Y 
jl> 1, Reze [3,5] ame } 


pictured in Fig. 3.38. Suppose that f € V(SL(2,Z),A) and g € N (SL(2, Z), u) with 
Fourier expansions 


f(z)= > Am(y) exp(2zimx), g(z) = £ bm(y)exp(2rimx). 


mEZ mEZ 


Then the Euclidean version of Green’s theorem on the region Dy says that if d /ðn 
is the normal derivative on the boundary ð Dy, and ds the differential of arc length, 
we have 


Joy (Ag — gf) idx dy = Jany (s£ — 23t) ds 
+1/2 
= 5 (an (Y)b,, (Y )e2zi(ntm)x _ ap (Y Jbm (Y Jerie) dx 
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Here we have used the fact that f and g are invariant under SL(2,Z) to see that the 
integral over the boundary d Dy reduces to the integral over the top horizontal line 
Im z = Y, since the SL(2,Z) identifications make the top the only real boundary of 
the manifold Dy. The last formula comes from the orthogonality of the exponentials 
{exp(2zinx), n € Z}. 

The explicit expressions for the Fourier coefficients of f and g from Exer- 
cise 3.5.3 imply that if m #0 and A = u = s(s — 1), then 


O = amb” m — dpb-m. 
Therefore, A = u = s(s — 1) implies that 0 = aob'ọ — a'ọbo. Moreover, we know that 
ao(y) =ay’+cy'* and bo(y) = by +dy'™. 


When s 4 E it follows that ad — bc = 0, which means that the dimension of the 


vector space V of constant terms is indeed 1. The dimension is obviously 1 in the 


case that s = 5: 


Note. You can think of the constants in part (b) of Theorem 3.5.1 as residues of the 
Eisenstein series at s = 1. 


The following exercise will be useful in the last section of this chapter when 
we evaluate the parabolic term of the Selberg trace formula. It gives an idea of the 
independence of the two Eisenstein series as distributions. 


Exercise 3.5.12. 
(a) Show that if p(s) = A(1 —s)/A(s), where A(s) = 2 “T(s)€(2s), then 


(sA gA (s-s) 
(s-s) (1 -s -—s) (s-s) (1—5 —s) 


1 
[Bes —dxdy = 
Da 4 


At o(s)(1=s—s) Apl) (+51) 
(s—s')(1—s'—s) (s—s’) (1 


_ Asts’-1 A!-s-5'9(s)Q(s') | 


sts’-1 sts’-1 


Aes!) A’-5Q9(s) 


s—s! s—s! 


+o(1), as Ao, 


Here D4 is the truncated fundamental domain in Fig.3.38 and E; denotes 
the Eisenstein series, as usual. The notation “little-o of 1”, o(1), stands for a 
function which approaches 0, as A —> œ. 

(b) Let s’ = 5, s =o + ir, in part (a) and take the limits as © 4 5 to show that the 
term in braces from part (a) approaches 
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1 1 
5 {loss ? (; -ir) ? (; ir) | 
2 @ \2 @ \2 


Hint. See Kubota [375]. Start with J (GE)Es -E (AEs )) +dxdy. Use 
Da 


Green’s theorem. 


Theorem 3.5.1 is a special case of a result of Maass [439, pp. 169-170] (see also 
Maass [437, pp. 195-—215]). Formulas similar to those in Exercise 3.5.12 have been 
called the Maass-Selberg relations by various authors (see Kubota [375, pp. 18- 
20], Harish-Chandra [252, p. 75], Hejhal [261, Vol. II], Langlands [393, p. 333)). 
However, the Maass—Selberg relations involve truncating the Eisenstein series rather 
than the fundamental domain. See the website of Paul Garrett for many pages on the 
subject: 


http://www.math.umn.edu/~garrett/ 


He says: “The (false) assumption that truncated Eisenstein series are 


essentially eigenfunctions does permit a plausible looking and traditional (!) 
derivation of the correct inner product of two truncated Eisenstein series... but, 
of course, also of any other assertion, true or false. This outcome can be and has 
been (erroneously) construed as validating the (incorrect) argument that truncated 
Eisenstein series are effectively eigenfunctions.” 

The Maass-Selberg relations will be used in Sect. 3.7 to evaluate the parabolic 
terms in Selberg’s trace formula. These relations are also used in many of the 
references mentioned above to obtain the analytic continuation of the Eisenstein 
series for very general discrete groups, a result that was quite easy for SL(2,Z) 
(see Theorem 1.4.1 of Sect. 1.4). It is also possible to give inner-product formulas 
for truncated Eisenstein series and then no o(1) term appears (see Selberg [571, 
pp. 183—-184]). Selberg [574, Vol. I, p. 673] contains some interesting commentary 
on the Maass—Selberg relations, which draws a distinction between the result of 
Maass and that of Selberg. 


3.5.3 Computations of Maass Cusp Forms 


Now, what can be said of the spaces of Maass cusp forms for SL(2,Z)? We know 
from Exercise 3.5.9 and Theorem 3.5.1 that this space of cusp forms is {0} unless s 
lies in the interval [0,1] or the real part of s is A Furthermore, Theorem 3.5.3 will 
show that, in fact, the space is also {0} when s € (0,1). We shall show in Sect. 3.7 
that SN (SL(2,Z),s(s — 1)) # {0} for an infinite number of values s. However, no 
one has ever produced an exact value of s for which the space of cusp forms is 
nonzero—much less an example of a cusp form for the modular group. 
Nevertheless, it is possible to obtain computer approximations of cusp forms. 
Note that a cusp form vanishes like a constant times exp(—2zy) as y approaches 
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infinity. Thus, it is not too far wrong to consider the problem of finding eigen- 
functions of the non-Euclidean Laplacian on the compact region Dy in Fig. 3.38. 
The boundary conditions would be the periodicity coming from f(z+1) = f(z) 
and f(—1/z) = f(z), plus the vanishing of f(z) on the horizontal line y = Y. 
Note, further, that the symmetry u(x + iy) = —x + iy leaves the domain Dy and the 
Laplacian invariant. Thus the symmetry splits the space of solutions to our boundary 
value problem into even and odd functions of Rez. The space of odd functions will 
satisfy the Dirichlet problem, requiring them to vanish on the boundary. The space 
of even functions will satisfy the Neumann problem requiring that their normal 
derivatives vanish on the boundary. 


Exercise 3.5.13. Prove the last statements about even and odd eigenfunctions of A 
on the fundamental domain for SL(2, Z). 


Computations of the eigenvalues of the non-Euclidean Laplacian for the Dirichlet 
and Neumann problems have been done since at least the 1970s. The earliest 
computations were made by Cartier [77], Cartier and Hejhal [78], Haas [244], 
Hejhal [263,266,273], Hejhal and Berg using a CRAY-I supercomputer (see Hejhal 
[261, Appendix C to Vol. II]), Hejhal and Rackner [274], and Stark [631]. The 
most recent computations were made by Stromberg [644], Booker et al. [48], and 
Then [677]. The last reference finds Maass cusp form for eigenvalues bigger than 
1.6 x 10°. Now you can find lists of eigenvalues and Fourier coefficients of the 
corresponding eigenforms on various websites such as LMFDB, the database of 
L-functions, modular forms, and related objects 


http://www.lmfdb.org/ModularForm 
or 
http://modform.org/ModularForms/MaassForms. 


There are also density plots on various websites; e.g., those of Hejhal, Stromberg, 
and Strombergsson. 

We give some older eigenvalue tables here. The first table is that of Hejhal and 
Berg. See Table 3.5. This table has a shorter list of eigenvalues for cusp forms 
corresponding to the Neumann problem because the numerical method that was 
used produces “spurious eigenvalues” for the Neumann problem. These spurious 
eigenvalues would fill in the blanks for Neumann eigenvalues, except that they 
correspond to spurious cusp forms with a logarithmic singularity at the point 
exp(27i/3) in the fundamental domain. Hejhal [263] proves this and shows that 
the s corresponding to the spurious eigenvalue À = s(s — 1) are exactly the zeros of 
the Dedekind zeta function of the algebraic number field Q(exp(27i/3)). 

See Sect. 1.4 for a discussion of the Dedekind zeta function. The particular one 
that arises in Hejhal [263] is actually also an Epstein zeta function as well as the 
product of ¢ (s) and L(s,(—3/*)), formed with the Kronecker symbol (—3/x). See 
the discussion after formula (3.74) of Sect.3.3 for references on the Kronecker 
symbol. The L-function appearing here is defined by 
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Table 3.5 Hejhal—Berg table 
of eigenvalues À of A on 
H/SL(2,Z), for 
A=s(s—1), s= 5+it 
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t for odd cusp forms 


Dirichlet problem 


9.53369 52613 536 
12.17300 83246 797 
14.35850 95182 59 
16.13807 31715 23 


t for even cusp forms 


Neuman problem 


13.77975 13518 907 
17.73856 33811 
19.42348 147 
21.31579 6 


16.64425 92018 8 
18.18091 78346 
19.48471 38 
20.10669 4 
21.47905 7 
22.19467 


24.41971 incomplete 
25.05085 below 
26.0568 double line 
26.4469 

27.28 


The last digit in each number is uncertain. From 
Hejhal [261, Vol. II, pp. 653, 730] 


Table 3.6 Zeros s = 5 +iy y for €(s) 


of C(s)L(s, (—3/+)) y for L(s,(—3/*)) 


14.134725 8.039737 

21.022040 11.249206 
25.010858 15.704619 
30.424876 18.261997 
32.935062 20.455771 


L(s,(—3/*)) = > (—3/n)n™* for Res >1. 


n>1 


This is a very special example of a class of L-functions associated to Dirichlet 
characters. For the analytic continuation and functional equations of such L- 
functions, see for example Davenport [120]. The general theory is developed in 
Lang [388]. One can find tables of zeros of Dirichlet L-functions (see Spira [613]) 
and of Riemann’s zeta function (see Haselgrove and Miller [256] and Brent [65]). 
Or you can look at the L-functions websites mentioned above. We list a few of these 
zeros in Table 3.6. 

Table 3.5 should be compared with Table 3.8 which was made by Cartier [77] 
and lists eigenvalues for the non-Euclidean Laplacian on a rectangle. It is also 
interesting to compare Table 3.5 with Table 3.7 of Haas. The main difference 
between the Haas table and that of Hejhal and Berg is the absence in the latter 
table of the Neumann eigenvalues corresponding to the zeros of the Dedekind zeta 
function just mentioned. The spurious eigenvalues in the Haas table come from 
the numerical method used by Haas as well as Hejhal and Berg. All start with the 
Fourier expansion of an even cusp form: 
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Table 3.7 The Haas table of eigenvalues A of A on H/SL(2,Z), 


A =s(s—1), s= 5 + it as corrected by Hejhal to contain 18.261997 
(which had been omitted by mistake) 


t for Dirichlet’s problem t for Neumann’s problem 
9.533695 8.039738 
12.17301 11.24921 
14.35851 13.77975 
16.13807 14.13473 
16.64426 15.70462 
18.18092 17.73856 
19.48471 18.261997 
20.10669 19.42348 
20.45578 


This table includes spurious eigenvalues in the Neumann list as 
is explained in the text. The spurious eigenvalues correspond to 
numbers in Table 3.6 


u(z)= >, cny!’ Ki (2nny) cos(2mnx) 


n>0 


and seek to find the unknown Fourier coefficients c, and the number t, by choosing 
N point z;,...,zy in H and solving 


u(zj) — u(z;) : 
— 0, fH, CMe 

lzi—z >] 
The condition can be rewritten as a system of N linear equations in the N unknowns 
Ci, provided that one forgets the terms after cy in the Fourier expansion of the cusp 
form u. Thus ¢t must satisfy 


det(l;(zj,t))ı<ijcn =O for J;(z,t) involving K-Bessel functions. 


Hejhal proves in the above-mentioned paper that the eigenvalues which appear 
on the Haas list and not on the Hejhal list (the zeros of the Dedekind zeta function of 
Q(exp(27i/3)) are spurious because they correspond to eigenfunctions that behave 
exactly like cusp forms except that they have a logarithmic singularity at the point 
exp(27i/3) in the fundamental domain. So the Fourier expansion that is produced 
by this numerical method will converge only for y > v 3/2 and not for y < J 3/2. 
One can draw a very interesting moral on the care required in using computers 
(especially when dealing with Neumann boundary value problems with continuous 
spectra). 
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Table 3.8 Cartier’s table of eigenvalues A = s(s—1), s = 5 + it, 
for A on the rectangle R = {x+iy|0<x<5,1<y<5} 


t for Dirichlet’s problem t for Neumann’s problem 
9.790 8.906 
12.421 11.631 
14.590 13.863 
16.522 15.837 
17.019 16.108 
18.291 17.644 
19.948 19.330 
20.309 19.465 


Hejhal also notes that the Fourier coefficients c, in the expansion 


f(z) = X, cnexp (2rinx)y! Ky (2n|n|y) 
n#0 


for a spurious eigenform f(z) with cı = 1 do not satisfy the inequality 
len| < oo(n)n3/?, (3.86) 


where co(n) is the number of positive divisors of n. However, as we shall see 
in Theorem 3.6.4 of Sect.3.6, we can assume that the cusp form f(z) is an 
eigenfunction for all the Hecke operators and we can normalize f(z) so that the first 
Fourier coefficient cı = 1. For such forms, the inequality (3.86) has been proved (see 
Moreno [477] and Selberg [570]). This gives another indication that the spurious 
cusp forms are suspect. 

The inequality (3.86) is a weak form of the Ramanujan—Petersson conjecture 
for cuspidal Maass waveforms which are normalized eigenfunctions of the 
Hecke operators (to be defined in Sect. 3.6). This conjecture says 


|cp| <2 for all primes p. (3.87) 


Such a result is not contained in Deligne’s theorem stated in Theorem 3.4.4 of 
Sect. 3.4. Better bounds than that given in Eq. (3.86) have been obtained, but 
Eq. (3.87) is still unproved as we write this (see Marie-France Vignéras [703] for 
a discussion of the improvement of the exponent a in Eq. (3.86) to i as well as 
a discussion of the connections with group representations). It is now known that 
we can replace 3/10 by 7/64. See Kim [351]. Tables 3.9 and 3.10 provide a small 
amount of numerical evidence for the Ramanujan—Petersson conjecture (3.87), by 
listing the first few Fourier coefficients corresponding to various Maass waveforms. 
Check the extensive tables on the website of Strombergsson for more evidence. 

There are now lots of tables of Fourier coefficients of Maass cusp forms available 
on the web. Booker et al. [48] say: “We have computed the first ten eigenvalues on 
PSL(2,Z)\H, namely, 
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Table 3.9 The Hejhal—Berg table of Fourier coefficients of cusp forms for the eigenvalue A 
of A on H/SL(2,Z), A = s(s— 1),s = 5 + it (cn denoting the nth Fourier coefficient) 


b for even 

cusp forms C2 C3 C5 c7 
13.77975 13518 907 1.54930 447794 0.24689 977245 0.73706 04 —0.2614 
17.73856 33811 —0.76545 80566 —0.97777 89075 —1.01527 35 1.1807 
19.42348 147 —0.69276 198 1.56235 43 —0.03841 2 0.313 
21.31579 6 1.28752 9 1.25177 3 1.170 —0.54 

t for odd 

cusp forms (op) C3 C5 C7 
9.53369 52613 536 — 1.06833 35512  —0.456197 354 —0.29067 256 —0.7449 
12.17300 83246 797 0.28925 18714 —1.20185 8761 0.03955 272 0.4481 
14.35850 95182 59 —0.23091 51912 0.69559 49863 —1.29828 45 —0.4834 
16.13807 31715 23 1.16185 5592 —1.28197 2561 —0.75680 63 —0.2985 
16.64425 92018 8 —1.54022 7825 0.97749 2591 —0.10524 2 —0.693 
18.18091 78346 0.37406 3346 0.10195 8698 0.63733 1 —1.542 
19.48471 38 —1.70018 80 —0.61456 54 0.8198 0.063 
20.10669 4 0.85884 4 0.18727 7 —1.395 0.78 
21.47905 7 —0.65625 1 0.22644 2 1.802 0.42 
22.19467 1.59685 —1.11648 —0.637 —1.00 


The last digit in each number is doubtful. From Hejhal [261, Vol. II, pp. 653, 730] 


r & 9.5337, 12.1730, 13.7798, 14.3585, 16.1381, 
16.6443, 17.7386, 18.1809, 19.4235, 19.4847, 


to a precision of more than 1,000 decimal digits, together with the first 455 Fourier 
coefficients a1,...,a455 to 900 digits (at least the first 50 of these were actually 
obtained to more than 1,000 digits)” They also note that “The computer time 
required was between one and three weeks per example ... (on a 1.5GHz PC).” 
Here we still give the old tables of Hejhal and Berg (see Table 3.9) and Stark (see 
Table 3.10). Hejhal used a supercomputer while Stark used the UCSD VAX. 

The numerical methods used by Hejhal and most of the others are limited by the 
decreasing size of the K-Bessel functions in the Fourier expansions of cusp forms. 
Accurate evaluations of the K-Bessel functions are essential. Riho Terras [675] did 
this for the papers I wrote involving K-Bessel expansions in the 1970s (see Purdy et 
al. [523]). Now the K-Bessel function is available in most computer packages such 
as Mathematica. However, those who do the massive computation of Maass forms 
still have to worry. Booker et al. [48] say “the most time-consuming task is, by far, 
that of computing the values of the K-Bessel function.” 

Hejhal appends this comment of C. L. Siegel to his table: 
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Ausserdem ist in Erwägung zu ziehen, dass die oben angegebenen numerischen Werte von 
v(x) mit Hilfe von Rechenmaschinen bestimmt wurden und daher ebenfalls im strengen 
Sinne unbewiesen sind.” —From Siegel [600, Vol. II, p. 439]).8 


In producing Table 3.10 of Fourier coefficients for the first even nonholomorphic 
cusp form, Stark [631] used the fact that the cusp form is an eigenfunction for the 
Hecke operators to be considered in the next section. Thus we have the equation 


cpf) =p? > f (=) +p? f(pz) 


jmodp 


(see Theorem 3.6.4 of Sect. 3.6). Stark then fixes a point such as z = 1.4i and looks 
at the Hecke points 


zy = (zt j)/p, f=... (3.88) 


One can find a matrix A; € SL(2,Z) such that w; = A jz; lies in fundamental domain 
for SL(2,Z). So our equation becomes 


cpf) =p? E fwi) +p? flee): (3.89) 


One then uses Eq. (3.89) to obtain the coefficients cp recursively with more and more 
accuracy. You calculate f(z) and the other values of f using the Fourier expansion 
of f, with a few Fourier coefficients; e.g., c1 = 1. Then you get some digits of c2. 
You plug those in and get more digits. This assumes that you know the eigenvalue, 
but you can also use the process to approximate eigenvalues by checking that the 
function f approximated is invariant under the substitution z > — 1 /z. 

Figure 3.39 shows a plot of the points w; used by Stark in formula (3.89) above 
for p = 983 after they have been moved (using the map z > —1/z) from the standard 
fundamental domain with its cusp at co to the fundamental domain with a cusp at 0. 
One can then ask whether these points wj, jmod p, become uniformly distributed 
(with respect to the non-Euclidean area) in the fundamental domain as p approaches 
infinity. This would account for the fact that the sum (3.89) of p terms times p? 
seems to be giving a result that is smaller by a factor of p`? than one would expect. 
In fact, the Ramanujan—Petersson conjecture for Maass waveforms would follow 
from such uniform distribution of the points w; in the fundamental domain. Indeed, 
Stark has suggested that perhaps one could use the Ramanujan—Petersson conjecture 
for holomorphic modular forms to show that the desired cancellation by p72 does 


occur. See Chiu [86] where it is proved that the Hecke points become dense in the 
fundamental domains of SL(2,Z) and SL(3,Z) as p — œ. See also Exercise 3.6.17. 


8One also has to keep in mind that the numerical values of v(x) were determined with the help of 
a calculator and have consequently not been proved in a strict sense. 
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Table 3.10 Stark’s table of Fourier coefficients for the first even cusp 


form 


p 


2 


157 
163 
167 
173 
179 
181 
191 


193 


c(p) 
1.5493044779 
0.2468997725 
0.7370603853 
—0.2614200758 
—0.9535646526 
0.2788270292 
1.3073417145 
0.0925585825 
1.1380685214 
0.7521138455 
0.0248519535 
0.1992656556 
—0.3040329968 
0.7832393635 
0.3605684105 
1.3980657196 
—1.5877309619 
1.1672759688 
—0.0270938863 
—1.6334238582 
1.0678477369 
—0.5311738889 
—0.9045323799 
—1.0673531716 
—0.0032571155 
0.8641852969 
—1.2318448417 
—0.8126520455 
—0.1537416810 
0.8117725443 
1.1696610310 
—0.6111258748 
0.7718026731 
0.0953561766 
—0.1869675229 
—0.2836096280 
1.1464542083 
1.5036129830 
—0.7243411282 
0.0377358321 
—0.5924790734 
1.8928188557 
0.4549235996 


—0.3510653631 


307 


349 


443 


449 


457 


463 


523 


541 


c(p) 
—1.4815318734 
0.5040655357 
0.0799 184237 
1.2661555488 
—1.6137265420 
—0.9962645200 
1.1244326539 
—0.0039967298 
0.4789645 104 
—1.5835570331 
1.1152062270 
0.439003 1691 
—1.4315675896 
—0.3501353058 
1.7625605824 
0.5325184967 
—0.0033416821 
—0.9876462546 
1.5933050198 
0.0502016847 
—0.3273589916 
—1.8931512743 
—0.8126034389 
1.4753959508 
0.7396796406 
0.87925 14066 
—1.2083667833 
—0.1571375484 
—0.6020990403 
1.0132298238 
0.4461683165 
—0.7081760488 
0.3300809302 
1.2726492305 
0.1481146844 
1.1482791938 
—1.1381929615 
—1.4887418682 
0.4359170070 
0.3463320737 
0.7903047596 
—0.8278509888 
—1.0508097534 


0.6476703225 


c(p) 
—0.9413471724 
—1.2022515538 
1.2204182342 
—1.6267782096 
—1.2717803924 
1.3158148270 
—0.7308513965 
—1.2242624630 
—0.0234437218 
1.3778855357 
0.0265995820 
0.5292669101 
—1.3160485151 
1.5888219871 
—0.5251706918 
—1.0362134271 
—0.7502921970 
—0.9208587111 
0.8958405935 
1.6323865759 
—0.9959260442 
1.3259008900 
—0.5856192114 
—0.1033879313 
—1.4675416297 
—0.4231228583 
0.1724453075 
—0.3987763947 
—0.9581311790 
—0.5238603585 
0.1600228277 
—0.1779829984 
0.4892801545 
1.5943055357 
—0.0944388002 
1.2009724917 
—0.9219874648 
1.1751819412 
—0.4805639483 
—1.1622088814 
1.4610158527 
—1.9462665557 
—0.1627440731 
0.5225227593 


(continued) 
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Table 3.10 (continued) 


197 —0.4934900086 547 0.0132913465 919 —1.0727170339 
199 0.7615593277 557 —0.8512529528 929 —0.1305562742 
211 1.6334955452 563 0.7157534392 937 —1.0585435062 
223 —0.8898352549 569 —0.6677981282 941 —0.7542322227 
227 —1.1170228371 571 —0.2379643425 947 1.3539019798 
229 —1.1231294363 577 0.9500705634 953 0.4262552906 
233 1.1318288732 587 1.0559415688 967 0.7553483561 
239 —0.6112502497 593 0.1197484790 971 —0.0605613598 
241 1.6168818500 599 0.8007081710 977 —0.3377517502 
251 —0.4397565620 601 1.2754657449 983 —1.7315964075 
257 0.3781399469 607 —1.5299526484 991 1.6842084188 
263 1.0274020100 613 0.6464422791 997 —1.0060566454 


From Stark [631]. Reproduced with permission from Ellis Horwood, 
Chichester, England. Normalized eigenvalue ~13.7797513519. 


Figure 3.40 is a representative density plot of a Maass cusp form from an older 
version of Dennis Hejhal’s website: 


http://www.math.umn.edu/~hejhal/. 


while Fig. 3.41 shows some density plots of Maass wave forms from Sarnak [560]. 

Physicists have asked whether distributions of eigenfunctions of A would 
“become localized” as the eigenvalue approaches infinity. By localizing, we mean 
that they tend to vanish on some open set in the fundamental domain. This idea 
came from looking at various sorts of billiard tables in the plane. If instead we look 
at density plots of Maass waveforms such as those in Figs. 3.40 or 3.41 from Sarnak, 
we will tend to think that the distributions do not localize. Well, I have to admit that I 
sometimes see monsters in these pictures and sometimes approximations to parts of 
horizontal lines or circles. But the horocycles and geodesics can bounce around quite 
madly. Thus even if the support of the function were concentrated on horocycles or 
geodesics, I still would not see an open set where the function vanishes in the limit 
of large eigenvalues. 

To get a clearer idea what is happening, let’s just take our discussion from Sarnak 
[560]. Suppose M is a Riemannian manifold with finite volume, negative curvature, 
and without boundary. Denote the Laplacian of M by A and the corresponding 
volume element by du normalized to have u(M) = 1. Suppose Ag + Ag = 0 
on M, 9 € L (Q). In quantum mechanics dvọ = |p|" du is interpreted as the 
probability distribution associated with being in the state @. Do these measures 
become equidistributed as A — œ or can they localize? Sarnak notes: “If M has 
(strictly) negative curvature, then the geodesic flow is very well understood thanks to 
works of Hopf, Morse, Sinai, Bowen, and others. It is ergodic and strongly chaotic 
in all senses. The periodic geodesics are isolated and are unstable, and there is no 
restriction on how they may distribute themselves as their period increases. ...” See 
the last section for more information on geodesics in the fundamental domain. 
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In his work with Rudnick [548] and others, Sarnak was led to the Quantum 
Unique Ergodicity conjecture (or QUE conjecture): 


Vo >H as À — œ. 


Sarnak [560] says: “Put another way, u is the only quantum limit.” In [325, 422, 
548,612] it is proved that the QUE conjecture holds for arithmetic manifolds such as 
SL(2, Z)\H both for continuous and discrete series Maass waveforms. An analogous 
result for holomorphic modular forms (which are Hecke eigenfunctions) has been 
obtained by Holowinsky and Soundararajan [299] as the weight goes to infinity. 


Some History 

In the spring of 1977, Riho Terras (referred to as mon ex-mari in Cartier and 
Hejhal [78]) and I spent a personally disastrous but academically exciting sabbatical 
at the University of Bonn. During that time we visited Paris and R.T. spoke to Cartier 
about computing K-Bessel functions to improve the eigenvalue programs. We were 
lucky enough to witness a general strike and R.T.’s talk had to be cancelled. 


-0.5 0.5 


Fig. 3.39 Images of Hecke points from formula (3.88) in a fundamental domain for the modular 
group obtained by applying the map z + —1/z to the standard fundamental domain. The points 
were used by Stark in the computation of Table 3.10. Here we take p = 983 and z = 2i 
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Fig. 3.40 A density plot of a Maass cusp form from an older version of Dennis Hejhal’s website 
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Fig. 3.41 Density plots of some Maass wave forms from Sarnak [560] 


Later that spring I visited H. Maass in Heidelberg and met H. Neuenhdéffer, who 
showed me the calculations of Haas. No one mentioned ¢ (s) and my thoughts were 
only on the eigenvalue problem (and my own troubles). Then Neuenh6ffer attempted 
to send me the Haas manuscript, but only the outside envelope arrived. Was someone 
in the Post Office interested in the Riemann hypothesis? Unknown! But finally a 
letter from Neuenh6ffer reached me which included the Haas tables. 

I put some of these tables in the first version of this book, on which I lectured 
at M.LT. in the Fall of 1978. Then H. Stark noticed the zeros of ¢ (s) in the Haas 
table. But he overlooked the L-function zeros somehow. By that time probably 100 
people had a preliminary version of this book with the Haas table in it. But very few 
people looked at it, perhaps. Anyway, ultimately D. Hejhal visited U.C.S.D. and I 
showed him the table. He immediately said that it gave him a headache. Evidently 
he went to the library and found the L-function zeros also, but he did not tell me that. 
Instead he wrote a very cryptic remark on my blackboard, which he does not want 
printed here. But I think that he did not believe the Haas table even then. He worked 
hard for several months, and the result was Hejhal [263], which was announced in 
the summer of 1979 at the Durham conference. 

In the preliminary version of this book I had attempted to use the Courant 
minimax principle to compare eigenvalue problems such as those represented by 
Tables 3.7 and 3.8 (see Courant and Hilbert [111, p. 409]). However, the existence 
of a continuous spectrum makes these arguments go awry. 

One might still wonder whether there is any chance to prove the Riemann 
hypothesis by interpreting the zeros s = 5 + it of C(s) as giving eigenvalues 
s(s —1) of some self-adjoint operator. Pólya and Hilbert evidently suggested 
this independently around 1915, though they were probably not thinking of the 
eigenvalue problem discussed here. Hejhal’s work appears to have laid to rest the 
possibility that A is the correct operator unless one can somehow deal with spurious 
eigenfunctions with logarithmic singularities. The fact that Epstein zeta functions do 
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not satisfy the Riemann hypothesis in most cases leads one to become dubious about 
; ; : 1 0 
this approach. In fact, the Epstein zeta function for the matrix e 2) has an 


infinite number of zeros in the region Re s > 1 (see Titchmarsh [680, p. 244]). 
See Venkov [695, p. 159] for a published version of a conjectural meaning for the 
eigenvalues associated to Maass cusp forms and their analogues. More recently there 
have been adelic versions of the Pélya—Hilbert conjecture by Paul Cohen and Alain 
Connes [102-104]. 


Exercise 3.5.14. Try to check Table 3.5. 


Note. Hejhal says the preceding Exercise is too hard. A class of beginning physics 
and engineering graduate students agreed when they tried the standard finite- 
element method on the problem. You need good Kj; programs if you use the Fourier 
expansion method that Hejhal used. It would also be interesting to program Stark’s 
method using Hecke operators from Sect. 3.6 to produce Table 3.10. 

Maass [439] shows that there are cusp forms for congruence subgroups of 
SL(2,Z) [but not for SL(2,Z) itself] which arise from Hecke L-functions of real 
quadratic number fields. See also Hejhal and Str6mbergsson [275] who call them 
CM Maass forms, where CM stands for complex multiplication, referring to a 
phenomenon that happens for elliptic curves. Marie-France Vignéras [699] derives 
this in another way by integrating a certain theta function. We will return to this 
topic in the next section. It does not appear that anyone has been able to use such 
constructions to obtain cusp forms for SL(2, Z), however. 

Hejhal [273] computes Maass wave forms for Hecke triangle groups and finds 
no even cusp forms for the nonarithmetic Hecke triangle groups. 


3.5.4 Elementary Estimates of Fourier Coefficients 
and Eigenvalues 


We have mentioned the Ramanujan—Petersson conjecture for Maass cusp forms 
(which are eigenfunctions for the Hecke operators of the next section). This 
conjecture was stated in formula (3.87). Let us now prove some easier estimates on 
Fourier coefficients. 


Theorem 3.5.2 (Estimates for Fourier Coefficients of Maass Waveforms). 
(a) Let E} (z2) =m *T(s)¢ (2s)Es(z) and 


Ex (z) = ay’ + by!’ + £ anyYK, 1 (22|n|y) exp(27inx). 
n#0 


Then, for every € > 0 there is a positive constant Ce such that 


lan] < Ce|n|Res- 2+, 
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(b) Suppose that the Maass cusp form f € SN (SL(2,Z),s(s—1)) has the Fourier 
expansion 


fC) = È anVVK,_ 1 (21|nly) exp(2minx). 
n#0 


Then there is a positive constant C such that 
1 
|an| < C|n|?. 


Proof. 


(a) We know from Exercise 3.5.4 that, if Res > 1, then 
lan| = Hn Jo1-26(n)| < n2 €(2 Re s— 1). 


However, if 4 < Res < 1, then |a| < 2\n|Res— 2 op (n). Then Hardy and Wright 
tell us that given € > 0, there is a positive constant Ce such that co(n) < Cen? 
(see Hardy and Wright [251, pp. 260-262]). 

(b) The proof proceeds as in Exercise 3.4.18 of Sect. 3.4. Use 


1 
any! PK, Only) = | f(+iy)exp(-2rins)dx 
and the fact that cusp forms are bounded on the upper half-plane. Then one sets 


y =c/|n|, where c is chosen so that Kı (27c) + 0, to complete the proof. 


Exercise 3.5.15. Fill in the details in the proof of Theorem 3.5.2. 


Note that in Theorem 3.5.2, when Res = 5 one has a better estimate for the 
Fourier coefficients of Eisenstein series than one has for cusp forms. This is rather 
strange if we remember the holomorphic case. There are better estimates than that 
given in Theorem 3.5.2 when the cusp form is an eigenfunction for all the Hecke 
operators. Then the Ramanujan—Petersson conjecture (3.87) above would say that 
the same sort of estimate would then hold. See the earlier discussion of Eqs. (3.86) 
and (3.87) for some references on Fourier coefficients of cusp forms. In addition, 
one can consult Bruggeman [68-70], and Kuznetsov [381]. 


Theorem 3.5.3. Suppose SN (SL(2,Z),A) 4 {0}, then à < —3n7/2. 
Proof (Roelcke [545, Sect. 7]). Suppose that 


u(z) = X, an(y)exp(2minx) € SN (SL(2,Z),A). 
n#0 
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Let D denote the usual fundamental domain for SL(2, Z), given in Fig. 3.14. And let 
D* = y(D), where y(z) = —1/z. Then if (u,u) = 1, as in Exercise 3.5.9, part (a), we 


have 
2 
A] =—2(Au,u) = | ( a *) eva 
pup* \ | dx 


> f |uParay= [San lan(y)Pay 


Ou 
dy 


Ix|<4 y2 v32"? 
y= 3/2 
> 372 ¥ |an( )\|’ dy = 32 J / | |"—dx dy 
aap y>v3/2 |x|<4 


1 
> 3r? | |u|? dx dy = 37°. 
D y 


This completes the proof. 


Note that Hejhal’s Table 3.5 gives a much better estimate for the first eigenvalue 
corresponding to a cusp form. However, Theorem 3.5.3 is of interest, because the 
argument can be applied to a few congruence subgroups of small level. 

Let |A (T)| denote the smallest nonzero eigenvalue of the Laplacian in absolute 
value on I'\H such that the corresponding eigenfunction is orthogonal to the 
constants. You might wonder whether there are discrete groups T in SL(2,R) such 
that 0 < |A,(T)| < i Selberg [570] gives examples of such groups I’. Randol [530] 
proves that such groups T exist with T'\H compact and having as many eigenvalues 
A in (—a,0) as you wish, for any given a. Moreover, Selberg [570, pp. 13-14] 
conjectured that if I is a congruence subgroup, then |A,(T)| > i This is now 
called Selberg’s eigenvalue conjecture. Selberg actually proved that |4; (T)| > 
3/16 = 0.1875. Sarnak [556] says “I think this is the fundamental unsolved analytic 
question in modular forms. It has many applications to classical number theory .... 
If true, it is sharp. In the first place, the continuous spectrum ... begins at 1/4.” See 
also Luo et al. [436] who show that |A; (T)| > 171/784 = 0.2181. 

Elstrodt [156], Hejhal [261, Vol. II], and Marie-France Vignéras [703] give more 
history of this problem and many more references. In particular, Vignéras notes that 
this conjecture and that of Ramanujan and Petersson given in Eq. (3.87) above are 
“deux volets d’une même conjecture” ° in representation theory (see also Satake’s 
article in Borel and Mostow [54, pp. 261—262]). Piatetskii-Shapiro [516] notes that 
the statement in adelic representation theoretic language can fail, but not for GL(n), 
using results of Jacquet and Shalika. 


°Two parts of the same conjecture. 
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3.5.5 Dimensions of Spaces of Maass Cusp Forms 


There is a conjecture that dimc SN (SL(2,Z),A) < 1 (see Cartier [77] as well as 
Booker et al. [48, p. 24]). All the eigenvalue tables created so far give evidence 
in favor of this suspicion. See Randol [531] for a proof that for sufficiently large 
eigenvalues, the dimensions of spaces of cusp forms for congruence groups T (p), p 
an odd prime, must be greater than 1, using knowledge of the degrees of irreducible 
representations of the finite simple group PSL(2,Z/pZ). Here we prove only that 
the SN (SL(2,Z),A) are finite-dimensional vector spaces. 


Theorem 3.5.4. The vector space of Maass cusp forms SN (SL(2,Z),A) is finite 
dimensional. 


Proof (Maass [439, pp. 154-156], using an idea of Siegel [600, Vol. II, pp. 97-137 
]). Let the Maass cusp form f be in SN (SL(2,Z),s(s—1)). Suppose that the Fourier 
coefficients in Exercise 3.5.3 are all zero, up to the coefficients a, with |n| = m. 
Then we can show that if m is sufficiently large (depending on t, when s = 5 +it), 
it forces f to be identically zero. We know that f is bounded on the upper half- 
plane by Exercise 3.5.10. Thus |f(z)| has a maximum at some point zo € H. Set 
zo = xo + iyo and note that 


1 
2 


a i 2Ky(2 
any} Ku (2m |n|yo) = I erin p (x Bt) a(2 1 ( site) 


| 2 Ka (nly) 
ky 


x=— 


It follows that 


M =|f(zo)| <2V2M J, Kiı(27|n|yo)/Ki (7 |n| yo). 


|n|>m 


The asymptotic formula for the K-Bessel function (see Exercise 3.2.2 of Sect. 3.2 
or Lebedev [401, p. 123]) shows that if m is sufficiently large (depending on ft and 
yo), the preceding series of quotients of K-Bessel functions can be estimated by 
exponentials. The result is that 


M < CMexp(—zmyo). 


If mis larger than (logC)/zyo, it follows that M is zero and, thus, that f is identically 
zero, as was to be proved. 


Exercise 3.5.16. Use the error term in the asymptotic expansion for Kj; (y) to obtain 
a bound for the dimension of the space SN (SL(2,Z),—(1? + })). This bound will 
be an increasing function of t. 


We can obtain an asymptotic formula for the sums of dimensions of spaces of 
Maass cusp forms with eigenvalues less than or equal to x, as x approaches infinity, 
once we have proved the Selberg trace formula (see Sect. 3.7). The trace formula 
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implies that if N(x) denotes the number of eigenvalues A, of A corresponding to 
cusp forms, counted with multiplicity, such that |A,| < x, then 


Mo Ceuma T oas xo, (3.90) 


The asymptotic formula (3.90) is the non-Euclidean analogue of the Weyl asymp- 
totic law for the distribution of eigenvalues of the Euclidean Dirichlet problem in 
a compact domain in R” (see Theorem 1.3.5 of Sect. 1.3 and Theorem 3.7.5 of 
Sect. 3.7). 

In particular, formula (3.90) implies that there are infinitely many Maass cusp 
forms for SL(2, Z). This application of the trace formula was pointed out by Selberg 
[573]. The argument only works for discrete subgroups T of SL(2, Z) such that T\H 
is noncompact of finite volume, with the property that there are good bounds on the 
constant term coefficients in the Fourier expansion of the Eisenstein series on the 
line Res = r Roelcke [545] gives more examples of groups I with infinitely many 
cusp forms. So there arose a conjecture, called the Roelcke-Selberg conjecture, 
to the effect that there are infinitely many cusp forms for very general discrete 
groups. Venkov has proved the infinite dimensionality of spaces of cusp forms 
with more general transformation properties for a wide class of discrete groups (see 
Venkov [693,695]). But Phillips and Sarnak [514] cast doubts on the conjecture for 
nonarithmetic groups. Hejhal’s computations [273] for Hecke triangle groups seem 
to add some evidence in that Hejhal found no even cusp forms for nonarithmetic 
Hecke triangle groups. There are always odd cusp forms for these groups by 
analogous arguments to those in Exercise 3.7.6. 

It is natural to ask whether there are connections between nonanalytic and 
analytic cusp forms. We will find that there are even more analogies in the next 
section. There are certainly various constructions that allow one to go from forms 
of one type to those of another type. Often this is done using a theta function for 
indefinite quadratic forms as a kernel and integrating (see Kudla [377,378], and 
Marie-France Vignéras [699]). Theta functions of indefinite quadratic forms and 
related matters are also discussed in Siegel [600, Vol. III, in papers nos. 55, 58, 
60]. A differential operator that maps nonholomorphic automorphic forms satisfying 
higher-order differential equations into holomorphic forms (in certain cases) is given 
by Schwandt [564]. See also Roelcke [544, Part I, Sect. 6] and Neuenhoffer [494]. 
Neuenh6ffer gives a construction of nonholomorphic cusp forms via residues of 
Poincaré series. 

Jacquet and Langlands gave an adelic version of a 1-1 correspondence between 
cusp forms for quaternion groups and cusp forms for congruence subgroups. Hejhal 
[272] gave a classical version of this correspondence for cuspidal Maass waveforms 
by integrating against a theta function for indefinite quadratic forms. See also 
Strombergsson [645]. 
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3.5.6 Finite Analogues of Eisenstein Series 


We have discussed finite upper half-planes H, over the finite field F}. Throughout 
this subsection we assume that q is odd, as we did in the Sect. 3.3.8 on finite 
upper half-planes. There we considered fundamental domains and tessellations of 
H; for subgroups T of GL(2, F4). Thus it is natural to consider analogues of modular 
forms for such subgroups T of GL(2,F,). Here we look only at complex-valued 
modular forms. Of course, you could also think about finite field-valued modular 
forms. Perhaps it can be argued that the most natural analogue of the modular 
group SL(2,Z) acting on the Poincaré upper half-plane is GL(2,F ,) acting on Hy, 
assuming q = p”, with r > 1. One can also consider more general subgroups as 
do Shaheen [583,584], and Hamahata [245]. Shaheen [583] shows that we have 
analogues of some of the modular functions in Sect. 3.4 and the present section as 
well as analogues of the special functions of Sect. 3.2, such as the power function, 
the K- and k-Bessel functions, and the gamma function. See also Shaheen and Terras 
[585]. 

First we need a multiplicative character x of F}. Suppose that g is a generator of 
the multiplicative group F}. The multiplicative character 7 = %a has the following 
form for integers a, b: 


2niab 


Xall) =e, O<a,b<q-2. (3.91) 
Then we define the power function associated to y by 


pz (z) = x (m2). 


As with the power function on the Poincaré upper half-plane H, py(z) is an 
eigenfunction of all the adjacency operators A, for finite upper half-plane graphs in 
formula (3.53). 

Let Y be an additive character of F}. Then for some b € F}, we have Y = Y, 
where 


2niTr(bu) 
Y,(u) =e P  , for b,u €F}. (3.92) 
Here if q = p”, the trace in the exponent is 


Tr(u) = Trg jp (U) =ut uP tu? +u. (3.93) 


Let Y be an additive character and y a multiplicative character of F4. Define the 
Gauss sum by 


r(x, ¥) =T) => LOYE). (3.94) 


“ 
te; 


The Gauss sum is analogous to the gamma function of formula (1.27). 
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See Ireland and Rosen [317] for more information on Gauss sums. In particular, 
they show that, if y and ‘VY; are not trivial, then 


rE Yb) = xb, Y1). (3.95) 


Another basic result says that if y and ¥ are not trivial, then 


[ra P)| = va. (3.96) 


The Kloosterman sum for a,b € F% is defined by 


(xla,b)= ¥ x(t) (at+br'). (3.97) 


rey 


The Kloosterman sum is analogous to the K-Bessel function of formula (3.17). 
These sums have great importance in number theory. See Sarnak [557] who 
discusses the connection with modular forms and “Kloostermania.” 

For a multiplicative character y of F% and an additive character ‘P of F4, if z € Hg, 
define the k-Bessel function by 


tenon) Bx (( aa) PO) 


This is an analogue of the k-Bessel function in formula (3.16) and, by defini- 
tion, k(z) = k(z|%,¥) is an eigenfunction of all the adjacency operators A, in 
formula (3.53) if k(z) does not vanish identically on Hy. 

Nancy Celniker et al. [82] find a finite analogue of formula (3.18) relating k- 
and K-Bessel functions. First we must recall the definition of the quadratic residue 
symbol £; by 


1, if y is a square in F*; 
€q(y) = ¢ —1, if y is not a square in F*; (3.98) 
0,y=0. 


We saw this symbol before in formula (1.33). 

For a nontrivial multiplicative character y of F% and a nontrivial additive 
character Y, of F4 and z € H4, we have the relation between k-Bessel functions 
on H, and Kloosterman sums: 


1 
Talat. Yok Ya) = gax 0)Pal =K (zey|-8y7,-7), 69 
where ga = 5 cF Pu?) is a Gauss sum and the Kloosterman sum K is defined by 
4ucr@ 


formula (3.97). 
Why do we call g4 a Gauss sum? It is an exercise to show that for a 4 0 
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ga= >, Yal) = ¥ (1+ ezlu)) Palu) = eg (a`!) Tg (€q,%1). (3-100) 


uck, ucFg 


Now we define an Eisenstein series that is analogous to the Maass Eisenstein 
series defined in formula (3.81). Let I be a subgroup of GL(2,F,) and let x be 
a multiplicative character on Fy Define the finite Maass Eisenstein “series” for 
z € Hy as 


E; r(z) = J, x@m(72)). (3.101) 
yer 


Note that we do not sum over I’..\I’, where T. is the subgroup of T consisting of 
: ab se ct : ae ; 
matrices (G ) €T. We do not need to do this, since I... is finite in this case and 
a 


thus just contributes a factor of |I]. 

By definition, Eyr is an eigenfunction of the adjacency operators in for- 
mula (3.53) for finite upper half-plane graphs as long as it does not vanish identically 
on H,. We found in Shaheen and Terras [585] that for q = p”, with q > 2 andr > 2, 
Ey GL(2,F,) 18 not identically zero if and only if y = Yq when a is a multiple of 
(p— 1). This last condition is equivalent to saying that XalEs = 1. For f,g:H, >C, 
define the inner product 


(f.8)= È FO). 


zEH4 


Computing (Eya: Exp ) allows one to show that the Eisenstein series are also not 
identically zero when r = 2 and % = Yq when a is a multiple of (p — 1). It also 
allows one to find out when these Eisenstein series are orthogonal. We leave this as 
an exercise for the reader. 

One the main results of Shaheen and Terras [585] is a finite analogue of 
Exercise 3.5.4 and says that the Fourier expansion of the Eisenstein series 


Ey c1(2,F,) (2) is given by 


aE. = PU g(€q,¥i)Vg(€qX, Y1) i 
pp irer) T(x, 1) X ( Oy) Ez ( ô) 
(3.102) 
pUq(&q,¥1) b b)K ( | ô 2 1) 
tra >. x(b)ea(b)Ky, | qx a »(—2). 


Tr(b)=0 


Shaheen and Terras also prove the analogous result for T = SL(2,F,). The key 
ingredient in the proof is Poisson summation for the subgroup F, of F4. 

Many questions remain on this topic. Are there finite analogues of cusp forms? 
Can we obtain them by averaging over I the spherical functions associated to 
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discrete series representations of G? That is take the Soto—Andrade spherical 
function of formula (3.56) and average it over I. We leave such questions to the 
reader. 

The paper [245] of Hamahata looks at finite analogues of Hilbert modular forms 
(to be discussed in Volume II, i.e., [667]) for a group I C GL(2,F 2), where y € 
GL(2,F 2) acts on (z,z') € Hp x Hp by ¥(z,z') = (yz, 72’). Here y is the matrix 
obtained from y by taking conjugates of all entries, where the conjugate on elements 


of Fg is defined as follows if F 2 =F, (v3), where 6 is a nonsquare in Fp: 


(x+yv5) =x-yv5, for x,y € Fp. 


3.6 Modular Forms and Dirichlet Series. Hecke Theory 
and Generalizations 


... Hecke took up the subject of modular functions and put it back into number theory where 
it had always belonged. ... 


—From Weil [729, Vol. IM, p. 301]. 


3.6.1 Dirichlet Series Corresponding to Holomorphic Modular 
Forms: The Hecke Correspondence 


We saw in the proof of Theorem 1.4.1 of Sect. 1.4 that the Mellin transform allows 
one to deduce the basic properties of the Epstein zeta function from those of the 
theta function. This is only a special case of a very general theory for which the 
boundaries have not yet been seen. 

In the 1930s, Hecke looked at the following situation (see Hecke [258, pp. 591- 
626, 627-643, 644-671, 672-707, etc.] and [259]). Suppose that f is a classical 
holomorphic modular form (see Sect. 3.4); i.e., f E€ M(SL(2,Z),k) with Fourier 
expansion 


f(z) = X, anexp(2minz), (3.103) 


n>0 


and consider the Mellin transform 
Mls) = [ FO) - a0) "ay (3.104) 


Then from Euler’s formula for the gamma function, we know that for Re x > 0 and 
Res > 0, 
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x “T(s j= f y5 —1 e “dy 
(see Lebedev [401, Chap. 1] for a discussion of I’(s)). It follows that 


Ap(s) = (2%) *T(s)Ly(s) with Lels) = ¥ ayn. (3.105) 


n>1 


We define L f(s) to be the L-series corresponding to the modular form f(z). The 
L-series in Eq. (3.105) converges for Re s > k, by Theorem 3.4.4 of Sect. 3.4. 


Exercise 3.6.1 (Analytic Continuation and Functional Equation of L-Series 
Corresponding to Holomorphic Modular Forms). 


(a) Show that f € M(SL(2,Z),k) implies that A p(s) defined by Eq. (3.104) can be 
continued to a meromorphic function of s € C so that 


Ay(s)+ f + 7 
S a = 
f Vs k-s 
is entire and bounded in vertical strips (EBV for short). 
(b) Show that, moreover, A;(s) satisfies the functional equation 


Af(s) =i Ag(k—S). 


Hint. Imitate the proof of Theorem 1.4.1 of Sect. 1.4. That is, split up the Mellin 
transform representing A(s) into 
1 oo 
p 
0 1 


Then use the transformation formula f(z) =z~* f(—1/z) to rewrite the integral over 
(0, 1]. The result is that, as before, the L-series can be evaluated by an incomplete 
gamma expansion: 


ik 
no) =a (- L+ + Yan (G( (s,2mn) + G(k —s,2nn)i*), (3.106) 
n>1 


with the incomplete gamma function 
G(s,x) =l tl exp(—xt)dt for Re x> 0, 
1 


as in Theorem 1.4.1 of Sect. 1.4. The exercise follows from this expansion, using 
the exponential decay of G(s,x), as x approaches infinity (see Exercise 1.4.7 of 
Sect. 1.4). 


Hecke noticed that Mellin inversion (see Sect. 1.4) allows the argument of 
Exercise 3.6.1 to be turned around. 
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Exercise 3.6.2 (A Converse Result in Hecke Theory). Suppose that f(z), with 

Fourier expansion in formula (3.103), is given and consider the function A f(s) 

defined by Eqs. (3.104) and (3.105). Suppose that A f(s) has the functional equation 
Af(s) = Ag(k—S). 


and that 


Af(s) + TE 
Pere s k-s 


is entire and bounded in vertical strips (EBV). Prove that then f(z) is a holomorphic 
modular form of weight k for SL(2, Z); i.e., that f lies in M(SL(2,Z),k). 


Hint. Note that z> z+ 1 and z > —1/z generate SL(2,Z)/ {+I} = PSL(2,Z). 
Thus, it suffices for us to show that f(z) = z-*f(—1/z). Use Mellin inversion to 
write 


1 i 
i — ay = =~ | =A p(s)ds, 
flr) a= zz f _ A 
where c is sufficiently large for the absolute convergence of Ap. Push the line of 
integration to the left and pick up residues at s = k and s = 0, to obtain 


1 
2ri 


f(iy)— ao = L y *Af(s)ds + iky ‘a — ao. 

Res=k—c 
Then use Af(k—s) = Ap(sé, to see that f(iy) = (i/y)* f (i/y) for all y > 0. To 
complete the exercise, note that if two holomorphic functions on H agree on a set 
with an accumulation point, they must agree on all of H. 


Some references for these exercises are Hecke [258,259], Lang [391], Ogg [505], 
and Shimura [589]. Note that more general groups than SL(2, Z) can also be treated 
in this way. The following theorem follows from the exercises. 


Theorem 3.6.1 (Hecke’s Correspondence for Holomorphic Modular Forms for 
SL(2, Z)). Suppose that f(z) has the Fourier expansion given in formula (3.103). 
Suppose that the corresponding Dirichlet series is given by formulas (3.104) 
and (3.105). Then we have the equivalence 

f € M(SL(2,Z),k) 


1 jk 
= Ay(s)+a0 (2+) is EBV with A,(s) = ÃAp(k— s). 
PEDT ! 


Here EBV means entire and bounded in vertical strips. 
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Example 3.6.1. The Dirichlet series corresponding to the classical Eisenstein series 
G, from Sect. 3.4 is 


X, O-1(n)n = C(s)E(s+1—k). (3.107) 


n>1 


The functional equation can actually be deduced from that of the Riemann zeta 
function, using the duplication formula for the gamma function and the functional 
equation of the gamma function. 


Exercise 3.6.3. Prove the assertions made in the preceding example. 


3.6.2 Dirichlet Series Corresponding to Maass Waveforms 


From our point of view, the next logical question was answered in the 1940s by 
Maass [439]. This question is: What is the analogue of Theorem 3.6.1 for Maass 
waveforms? Another reference for the answer is Maass [437]. 

In Hecke’s case we needed to know the Mellin transform 


| ey dy = T(s) for Res>0. (3.108) 
0 


In Maass’s case we need to know the Mellin transform 


[Koy ars ar (7) (5), if Res>|Rer|. (3.109) 
0 


Exercise 3.6.4. Prove formula (3.109). This was part (b) of Exercise 3.2.7 in 
Sect. 3.2. 


Hint. Use the integral formula in Exercise 3.2.1 of Sect.3.2 for the K-Bessel 
function to see that 


1 . 1 
RU p= >| f ey [=r +) dt dy. 
y>0 2 Jy>0Jt>0 2 


Then make the substitution u = yt, v = y/t, to complete the proof. 


Let f be a Maass waveform; i.e., f E€ N(SL(2,Z),r(r— 1)), with Fourier 
expansion as given in Exercise 3.5.3 of Sect. 3.5: 


f(2) =ay" thy! + ¥ any?K, (22|n|y)exp(27inx). (3.110) 
n#0 
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Consider the Mellin transform: 


Wy(s) = | (fli) —ay’ — by!) dy. (3.111) 


om, 8 


Then, Exercise 3.6.4 shows that 


W/(s) =252q- OHDOT (= — =) r (=) L;(s), (3.112) 
with 
Lp(s) = X an|n|-O*. (3.113) 
n#0 


We define L,(s) to be the L-series corresponding to the Maass waveform f € 
N(SL(2,Z),r(r—1)). This series converges by Theorem 3.5.2 of Sect. 3.5 when 
Res > |Rer|, if f is not a cusp form, and for Res > 1, if f is a cusp form. 


Exercise 3.6.5 (Analytic Continuation and Functional Equation of L-Series 
Corresponding to Maass Waveforms). Define the higher-dimensional incomplete 
gamma functions G(s1,523a) by 


G(s1,52;a) = I uly eTa) ay dv, for Res;>0 and Rea > 0. 


uv>1 


(a) Use the same trick as in Exercise 3.6.1 to show that W; (s) in Eq. (3.112) has the 
following incomplete gamma expansion, assuming f € N (SL(2,Z),r(r— 1)): 


Ws) 1 4. 1 b 1 p 1 
s) = H 
f a S—r —sS-r s+r—1 -s+r—-1 


str s—r+1 —str l—s-r 
+Z ado ( an) (5an). 


n0 


(b) Show that the incomplete gamma function G(s,,52;a) dies off exponentially as 
a approaches infinity. 
(c) Deduce that W,(s) continues to a meromorphic function of s such that 


w;(s) 1 i 1 b 1 : 1 
Je S-r —s-r s+r—1 —s+r-l1 


is EBV, with functional equation Wọ (s) = Wp (—s). 


Hint. To prove (a) use the same substitution that appeared in Exercise 3.6.4. 
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Now we want to argue that Mellin inversion implies a converse to Exercise 3.6.5, 
analogous to Hecke’s converse result in Theorem 3.6.1. However, in Exercise 3.6.2, 
we use a fact about holomorphic functions to show that f(iy) = (i/y)*f(i/y) for all 
y > O implies that f(z) = z~*f(—1/z) for all z in H. But real analytic functions f(z) 
and g(z) which coincide on Rez = 0 are not necessarily equal. For an example, let 
f(z) = exp(iy) and g(z) = exp(x + iy). Thus we need the following exercise from 
Maass [439]. 


Exercise 3.6.6 (Facts About Eigenfunctions of the Laplacian). Suppose that f 
is an eigenfunction of the non-Euclidean Laplacian; i.e., Af = À f, and suppose, in 
addition, that f|x=0 = frlx—o = 0 for all y > 0. Here fy = 0 f/0x. Show that f must 
be identically zero on H. 


Hint. We know that f(z) = Zn>0Cn(y)x". Thus cn(y) satisfies the recursion obtained 
from 


O=Af—Af=¥ (n(n—Vy’en(y)x”? + yeh (y)x" — Aen(y)x”) . 


n>0 


The initial conditions then imply that the c,(y) are identically zero. Note that without 
the hypothesis that 0 f /ðx|x=0 = 0, we could only prove that con(y) = 0. 


In order to obtain the second initial condition for the function f(z) — f(—1/z), 
we need the second L-series corresponding to the Maass waveform f € 


N(SL(2,Z),r(r—1)): 


Ly.(8) = X (2min)ap|n|- 2). (3.114) 
n#0 


This second Dirichlet series is also seen to be necessary because the first series 
Ly, from Eq. (3.112), will vanish identically if f is an odd function of x; i.e., if 
f(x +iy) = —f(—x+ iy). For then a, = —a_,. And Table 3.5 of Sect. 3.5 as well 
as the tables of Booker et al. [48] give some evidence for the conjecture that the 
Dirichlet and Neumann problems have different eigenvalues. A cusp form f has the 
decomposition f = u + v, where u is a solution of the Dirichlet problem and v is 
a solution of the Neumann problem. So, probably, either f = u or f = v; i.e., f is 
either even or odd, as a function of x. If f(z) denotes a Maass waveform solving the 
Dirichlet problem, we see that the L-series L¢(s) = 0, and we must consider Ly, (s) 
to obtain a nonzero series. 

Using the differentiated modular form is an old trick in number theory. One needs 
it to obtain the analytic continuation of Dirichlet L-series: 


LY xn)” 


n>1 


when v(—1) = —y(1) and y is a multiplicative character on the group of units in 
Z/mZ, viewed as a function on Z by writing y (a) = 0 if gcd(a,m) # 1. A reference 
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for the analytic continuation of these L-series is Davenport [120]. Hecke generalized 
this to grossencharacters for algebraic number fields (see Hecke [258, pp. 215-234, 
249-289]). 


Exercise 3.6.7 (Analytic Continuation and Functional Equation for the Second 
L-Series Corresponding to a Maass Waveform). Note that f(—1/z) = f(z) 
implies that f,(—1/z) = z? f(z); so fy has weight 2. Imitate the argument in 
Exercise 3.6.5 to obtain the analytic continuation and functional equation of Ly,. 


The following theorem can now be proved as an Exercise. 


Theorem 3.6.2 (The Maass Correspondence Between Maass Waveforms and 
Dirichlet Series). Suppose that f(z) has the Fourier expansion given by for- 
mula (3.110). Then f € N(SL(2,Z),r(r —1)) is equivalent to the following 
assertions about the Dirichlet series defined in formulas (3.112)-(3.114): 


(1) 
1 1 1 1 
Was) (+) »( —) 


is EBV with functional equation W;(s) = Wy(—s), and 
(2) Wy, has the analogous properties (see Exercise 3.6.7). 


Exercise 3.6.8. Complete the proof of Theorem 3.6.2. 


Maass actually proved a result like Theorem 3.6.2 for functions f(z) satisfying 
more general differential equations and invariant under more general discrete 
subgroups of SL(2,R). 

Expansions of Dirichlet series in series of higher-dimensional incomplete gamma 
functions have been studied by many people. They arise whenever there are several 
gamma factors in the functional equation of the Dirichlet series (see Goldfeld and 
Viola [222], Lavrik [399], and Terras [662, 663)). 


Example 3.6.2. The Dirichlet series corresponding to the nonholomorphic Eisen- 
stein series 


4Y o_x-(n)n"2n- 8? = 46 (s—rt 1S (str). (3.115) 


n>1 


Here again the functional equations can be deduced from that of the Riemann zeta 
function. 


Exercise 3.6.9. Prove the assertions made in the preceding example. 
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3.6.3 Remarks on Extensions to Congruence Subgroups 


One can use slight extensions of Hecke’s theorem to derive the product expansion 
of A and @ in Sect. 3.4 (see Ogg [505, Chap. I, pp. 43-46], [506]). However, many 
important applications require a major extension of the theory, in particular, to 
congruence subgroups such as F(N), which is defined after formula (3.51). Hecke 
and Maass both considered extensions of the theory to congruence subgroups. A 
simple example to explain why Hecke was interested in congruence subgroups can 
be found in Hecke’s lectures, which were given at the Institute for Advanced Study 
at Princeton in 1938 (see Hecke [259, p. 43]). There it is noted that the zeta function 
for the imaginary quadratic field Q(,\/—7) has as its inverse Mellin transform a 
theta function which is a modular form for '9(7) = the congruence subgroup of 
SL(2,Z) consisting of matrices whose lower left entry is divisible by 7. Hecke also 
made connections with the representations of the finite group SL(2,Z/nZ) (see also 
Eichler [151]). 

In 1967, Weil extended Hecke’s correspondence between modular forms and 
Dirichlet series to congruence subgroups in a different way (see Weil [729, Vol. 3, 
pp. 165-172]). It might be interesting to connect the two points of view. In fact, 
Weil extended Hecke’s theory to the subgroup To(N) consisting of all elements 
of SL(2,Z) for which the lower left entry is divisible by N. Here Weil discovered 
that the converse theorem requires more than one functional equation. Because the 
groups involved do not even contain the inversion z++ —1/z, one has to add the 


matrix 
0 —1 
N 07’ 


forming a larger group than lo(N). The resulting group will have more than two 
generators, in general. Weil finds that to make f(z), with Fourier coefficients an, a 
modular form for To(N), one requires the functional equations of the L-functions 


È x(n)ann™ 


n>1 


for sufficiently many primitive characters y of the unit group (Z/mZ)* = 
{bmodm | gcd(b,m) = 1}. Here one considers only m that are relatively prime 
to N. The functional equations of these L-functions involve Gauss sums. Other 
references for Weil’s result are Diamond and Shurman [133], Koblitz [359], Lang 
[391], Ogg [505], Razar [539], and Shimura [589]. 

Weil’s result involves explicit formulas for elements of the congruence subgroup. 
From the point of view of harmonic analysis, one might expect that harmonic 
analysis on '(N)\H should be a combination of analysis on T'(1)\H and that on 
T(1)/T(N). The latter quotient is isomorphic to SL(2,Z/NZ). The principal series 
representations of the finite special linear group are related to characters of the group 
(Z/NZ)* (see Kirillov [353, p. 267]). This sort of thinking would lead one to expect 
a result similar to Weil’s but involving L-functions formed with characters mod N 
itself, rather than modm, with gcd(m,N) = 1 (see Razar [539]). 
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Maass [439] actually obtains the correspondence between nonholomorphic 
modular forms for congruence subgroups and Dirichlet series with functional 
equations involving two gamma factors. This allows Maass to prove the existence 
of cuspidal Maass waveforms for congruence subgroups, distinct from SL(2,Z) 
itself, by using the fact that the functional equations of Hecke L-functions with 
grossencharacters for real quadratic fields involve two gamma factors. These are the 
CM Maass forms of Hejhal and Strombergsson [275]. 

Jacquet and Langlands [323] give an adelic view of Hecke theory. See also the 
collection of papers introducing the Langlands program edited by Bernstein and 
Gelbart [37], as well as Gelbart [200] and Weil [728], plus Borel and Casselman 
[52]. Another reference is Winnie Li [416-419], who has obtained an adelic version 
of Hecke theory closer to that envisioned above from the point of view of harmonic 
analysis. 

Hecke theory has implications for certain conjectures of number theory and 
geometry. Number theorists have been interested in Weil’s result for weight 1 
modular forms because of the Artin conjecture that Artin L-functions associated 
to representations of Galois groups of number fields are entire provided the 
representation is nontrivial and irreducible. References for Artin L-functions include 
Heilbronn’s article in Cassels and Frohlich [79, pp. 204—230], Lang [388], and Stark 
[632]. This conjecture has implications for the Dedekind zeta functions of number 
fields. For example, it implies that whenever a number field F is contained in a larger 
number field K, the quotient of Dedekind zeta functions given by €x(s)/Cr(s) is 
an entire function of s. This result is known if K/F is Galois or normal. Putting 
results of Weil, Langlands, Deligne, and Serre together, one sees that (modulo 
the Artin conjecture) certain forms of weight 1 for T9(N) correspond to Artin L- 
functions for irreducible two-dimensional representations of the Galois group of 
the algebraic closure of the rationals. Because Artin Z-functions have functional 
equations involving many gamma factors, there is a general feeling that these L- 
functions must somehow correspond to automorphic forms for GL(n). This sort of 
feeling is part of the “Langlands philosophy.” References for some of these things 
are Borel and Casselman [52], Serre [577], Tate [658], and Weil [729, Vol. III, 
notes especially]. Langlands [394] and Tunnell [686] have proved some new cases 
of the Artin conjecture for two-dimensional representations of Galois groups using 
base change and twisted versions of the trace formula. 

There is also a geometric conjecture related to Hecke theory. This is the 
conjecture of Taniyama, Shimura, and Weil which says that any elliptic curve 
over the rationals has a zeta function coming from a modular form of weight 2. 
It can be shown that modular forms of weight 2 give rise to L-series which are 
zeta functions of elliptic curves or abelian varieties. The conjecture of Taniyama, 
Shimura, and Weil has been proved and is now called the modularity theorem. As 
part of his proof of Fermat’s last theorem, Andrew Wiles proved the modularity 
theorem for a large set of elliptic curves, with a key ingredient from his joint 
work with Richard Taylor. See [661,740]. The complete modularity theorem was 
proved by Breuil et al. [66]. See Diamond and Shurman for an introduction to 
this subject with countless versions of the modularity theorem stated. See Weil 
[729, Vol. III, notes] for some discussion of the Taniyama, Shimura, Weil conjecture. 
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See Lang’s article [392] about the naming of the conjecture. There are also many 
open questions on the Hasse—Weil zeta functions of algebraic varieties (even elliptic 
curves). References include Katz [344, pp. 300-301] and Swinnerton-Dyer [650]. 


Exercise 3.6.10 (Operators That Raise the Level). 


(a) Suppose that y is a multiplicative character (modm); i.e., 
x :(Z/mZ)* + T, ahomormorphism of multiplicative groups, 


where (Z/mZ)* is the multiplicative group of integers a modm, with a 
relatively prime to m, and T is the multiplicative group of complex numbers 
of norm one. We extend y to a function on Z* by setting y(n) = 0 if n and m 
are not relatively prime. Suppose that f € M(SL(2,Z),k). Define 


LIÐ = X, X(u)exp(—2miuv/m) f(z+v/m). 


u,(modm) 


Show that Ly f is a modular form for the congruence group To (m’). 
Suppose that a is relatively prime to m and a’ is an integer such that aa’ = 
1(modm). Let Ra be a matrix in SL(2,Z) such that 


(b 


ew 


/ 
Ra = k o) (mod N). 
Oa 


Ra is not well defined, but the coset of Ra in F (N)\SL(2, Z) is well defined. Set 
N = m° and show that 


Ly flr, =x% (a)Lyf_ for f as in part (a). 


Here the slash operator is defined by 


az+b 
= dg | =— ), if 
gly(z) = (cz+ d) (22), i 


y= ( > € SL(2,Z), g of weight k. 
C 


(c) Suppose that f(z) is as in part (a) and f(z) = £n>0an exp (27inz). Show that 


Ly f(z) = ¥ any (n) exp(2zinz). 


n>0 


This leads to the twisted L-function studied by Weil. 
(d) Suppose that f(z) is as in part (a). Let y be a multiplicative character modm. 
Define 
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O= È x@)f(et+a/m). 


a(modm) 
gced(a,m)=1 


Show that fy is a modular form of weight k for To(m?) and that, using the 
notation of part (b), fylr, =X 7(a)fy- 
(e) Can you connect the operators in parts (a) and (d)? 
Hint. Parts (a)-(c) can be found in Ogg [505, Chap. IV, pp. 37-38]. Part (d) comes 
from Eichler [151, pp. 147-149]. See also Razar [539]. 
It will help to note that there is a chain of subgroups of T = SL(2,Z) given by 


T(N) CT\(N) CTo(n) CT, 


where T\(N) = {ret 


y= E The quotients T\(N)/T(N) and 


T9(N)/T1(N)are abelian; the first isomorphic to the additive group Z/NZ and the 
second isomorphic to the multiplicative group (Z/NZ)*. 

Note also that [as in part (a) of Exercise 3.6.14 which follows], if y is a matrix 
of determinant N, then transforming f(z) by y using the slash operator 


flre) =N"? (cz+-d) *f((az+b)/(cz+d)) for y= - 1) 


sends a form for SL(2,Z) of weight k to a form for T(N) of weight k. 


Exercises 3.6.10 and 3.6.14 which follows, give a large number of examples of 
modular forms for congruence subgroups. Classically, examples of the form f (mz) 
arose in the theory of equations for f(mz) in terms of f(z), for example, when f(z) 
is a modular function, such as j(z). See Shimura [589, pp. 109-110] for a discussion 
of the modular polynomial, also Weber [724, Vol. III]. 


3.6.4 Hecke Operators on Holomorphic Forms 


Hecke also developed a theory that will determine what modular forms correspond 
to Dirichlet series with Euler products. The first Euler product to arise in number 
theory was that for the Riemann zeta function: 


C(s)= J] (-p*)!, Res>1 (3.116) 


p prime 


(see Exercise 1.4.4 of Sect. 1.4). It is this result that governs the application of the 
Riemann zeta function to the study of the distribution of primes. 
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Around 1935, Hecke used Hecke operators to answer the question: Which 
modular forms correspond to L-series with Euler products? (see Hecke [258, 
pp. 577-707] and [259]). Such operators had already been considered by Hurwitz 
(311, pp. 163-188] and Mordell [476]. It turns out that Hecke operators are related 
to the analogue for p-adic groups of the spherical functions considered in Sect. 3.2 
(see Gelbart [200, pp. 47 ff]). There is now a general theory of Hecke operators for 
a group G and a subgroup H. See Krieg [373] and Shimura [589]. 

As usual, we shall restrict ourselves to SL(2,Z), although Hecke did not. For 
each positive integer n, let M, denote the set of all 2 x 2 matrices of determinant n. 
Set = SL(2,Z). It is easy to see that M, is a disjoint union: 


m= U a (3.117) 


ad=n,d>0, 
b(modd) 


Exercise 3.6.11. Prove formula (3.117). 


Hint. It is clear that given a matrix L € My, there exists a matrix y € SL(2,Z) such 
that yL has zero for its lower left entry. Here we use the fact that a row vector of 
integers is the second row of a matrix in SL(2,Z) if and only if the entries of the 
vector are relatively prime. 


Definition 3.6.1. The Hecke operator T, is defined on f € M(SL(2,Z),k) by 


b 
Ligaen F ats (SP), (3.118) 
ad=n,d>0, 
b(modd) 


Clearly the Hecke operator 7, also depends on k, the weight of the modular form 
f(z) on which T, acts. If f(z) is a modular form of weight k, then the Hecke operator 
is actually independent of the choice of representatives for SL(2,Z)\M,. Thus we 
can consider the sum in Eq. (3.118) to be a sum over SL(2,Z)\M,,. We will see that 
analogous sums arise in the theory of automorphic forms for SL(n, Z) in Volume II 
[667]. Other references for Hecke operators are Apostol [12, pp. 120-138], Kilford 
[350], Koblitz [359], Krieg [373], Lang [391, Chaps.2 and 3], Ogg [505], Serre 
[576], Shimura [589], and Stein [638]. 


Theorem 3.6.3 (Properties of Hecke Operators on Spaces of Holomorphic 
Modular Forms). 


(1) The Hecke operator T, defined by formula (3.118) has the property that if f € 
M(SL(2,Z),k), then Taf E€ M(SL(2,Z),k). Moreover, if f is a cusp form, so 
is Tyf. 

(2) Suppose that f E€ M(SL(2,Z),k) has Fourier expansion 


f(z) = È, anexp(2nimz). 


m>0 
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Then T,, f has the Fourier expansion 


Tf (2) = È, bmexp(2mimz) withbm= Yd 'amnja 
m>0 d | gcd(m,n) 


Here the sum defining bm is over all positive divisors of the greatest common 
divisor of m and n. 
(3) We have the multiplication formula 


TnIm = £ dT nja? 
d | gcd(m,n) 


And the algebra generated by the Hecke operators {T,, n > 1} is commutative. 
(4) Define the Petersson inner product of two cusp forms f,g € S(SL(2,Z),k) by 


= 7N. k—2 
[f8] = f aaa Y dx dy. 


Then [Taf 8] = |f, Tng]. It follows that the Hecke operators can be simultane- 
ously diagonalized on M (SL(2, Z), k). 
(5) Suppose that f E€ M(SL(2,Z),k) with 


f(z) = È, amexp(2rimz), 


m>0 


and a, = 1, satisfies Taf = cnf, for n = 1,2,3,.... Then an = Cy, for all n = 
1,2,3,.... Moreover, the Dirichlet series associated to f has an Euler product 
of the form 


L;(s) = Yann = Il (1 Sapp pr 


n>1 p prime 


Conversely, if L¢(s) = Xn>14nn~* has such an Euler product and is known to 
correspond to a modular form f in M(SL(2,Z),k), then f is an eigenfunction 
for all of the Hecke operators T,,. 

(6) The eigenvalues of the Hecke operators T, are totally real algebraic numbers. 


Exercise 3.6.12. Prove the preceding theorem using the same sorts of arguments 
that we will give in the proof of Theorem 3.6.4, which concerns the analogue 
for Maass waveforms. Details can also be found in the references preceding 
Theorem 3.6.3. The only part of Theorem 3.6.3 that does not have an analogue 
in Theorem 3.6.4 is part (6). To see this, use the fact that the space of modular forms 
of weight k has a basis consisting of elements for which all the Fourier coefficients 
are rational (see Ogg [505, HI-12)). 


Example 3.6.3. The Eisenstein series G; gives an example of an eigenfunction of 
the Hecke operators, since the corresponding Dirichlet series has a Euler product: 
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Le, (8) = E(s)E(s+ 1 — k). 


Example 3.6.4. The cusp form A of weight 12 must also be an eigenfunction 
of the Hecke operators, since it is unique, up to a constant multiple. Thus the 
corresponding Dirichlet series 


La(s) = BS t(n)n * 


n>1 


has an Euler product, a result first proved by Mordell [476] in 1917. Ramanujan 
had stated this result some years before Mordell’s proof, as well as the Ramanujan 
conjecture (see Sect. 3.4). 


Hardy [249, Chap. X] discussed the Ramanujan T in his book on Ramanujan, 
beginning with the remark: “We may seem to be straying into one of the backwaters 
of mathematics, but the genesis of t(n) as a coefficient in so fundamental a function 
compels us to treat it with respect.’ Weil considered this remark in his 1972 
Columbia lectures (see Weil [729, Vol. 3, pp. 280—281]) as “a typical example of the 
deep gulf that separates number-theorists from an analyst like Hardy.” This sort of 
mathematical name-calling is fun, but history clearly has a way of refuting all such 
assertions, including many made by the present author in this and earlier versions 
of this book. 

More examples can be found in Hecke [258,259]. In particular, paper no. 37 in 
Hecke’s collected works [258] gives a list of Dirichlet series Ly, showing that the 
presence of an Euler product is neither necessary nor sufficient for the existence of 
infinitely many zeros on the line of symmetry Res = k/2. 

Goldstein [225] has considered analogues of the Riemann hypothesis and 
Mertens conjecture for the Dirichlet series attached to cusp forms which are 
eigenfunctions for all the Hecke operators. Anderson [6] disproved the Mertens 
conjecture for these Dirichlet series corresponding to such cusp forms when the 
weight is sufficiently large. In 1985, Odlyzko and te Riele disproved the original 
Mertens conjecture (see [504]). 

Bounds for the degree of the number fields containing the eigenvalues of Hecke 
operators are obtained in Rankin and Rushforth [535]. Naomi Jochnowitz [329] 
studied the algebra T“ of Hecke operators acting on holomorphic cusp forms of 
weight k for SL(2, Z). Then T) @Q is a direct product of totally real number fields 
whose dimension over Q equals the dimension of the space S(SL(2,Z),k) over C. 
Moreover, in all known cases the direct product is actually a single number field. For 
example, Hecke [258, p. 671] shows that TC® @Q ~ Q(v 144169) and that TC® 
is the unique suborder of index 24 in the ring of integers of this field. Jochnowitz 
shows that if N is any integer and k is sufficiently large then N divides the index of 
T) in the maximal order of T® QQ. 

The three collections of Math. Reviews in number theory (see [243, 414, 456]) 
contain references to many papers on Hecke operators, etc. For example, there 
are reviews of some of Hecke’s papers (see Leveque [414, Vol. 2, pp. 448 ff]) 
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as well as a formula of Petersson for ¢ (3), involving his inner product and theta 
functions. Of course, one can also visit the Math. Reviews website or just search 
the Internet for Hecke operators. Wohlfahrt [743] generalizes the theory of Hecke 
operators to automorphic forms of real (rather than integral) weight. Rademacher 
[526] shows that Tp- is essentially a Tchebychef polynomial of the second kind 
in T,. Atkin and Lehner [16] complete the theory of Hecke operators for the 
congruence group I'p(N). The problem was to find a satisfactory theory of operators 
for primes dividing the level of the congruence subgroup. Ihara [314] uses the 
Eichler—Selberg trace formula to connect Ramanujan’s conjecture with the Weil 
conjectures on algebraic varieties—a project ultimately completed by Deligne, as 
we noted in Sect. 3.4. 


3.6.5 Hecke Operators for Maass Wave Forms 


The Hecke operators in the nonholomorphic case were developed by Maass 
[437, 439]. 


Definition 3.6.2. For a Maass waveform f € N(SL(2,Z),r(r —1)), and n = 
1,2,3,..., define the Hecke operator T, by 


b 
hf(j=n? X (= i (3.119) 
ad=n,d>0 
b(modd) 


i I 5, : : ; 
The normalizing factor n~2 is not consistent with formula (3.118), since our 
Maass waveforms have weight 0, but it has become standard in the Maass waveform 
literature. 


Theorem 3.6.4 (Properties of Hecke Operators on Maass Waveforms). 


(1) The Hecke operator T,, defined by formula (3.119), has the property that if 
f €N(SL(2,Z),r(r—1)), then Taf € N(SL(2,Z),r(r—1)). Moreover, if f is 
a cusp form, then so is T, f. And the map f > Tf is linear. 

(2) Suppose that f € N(SL(2,Z),r(r—1)) has the Fourier expansion 


f(z) Sey +y T+ py cmV/VK,_ 1 (27|m|y) exp(2zimx), 
m#0 


and 


Taf (2) = by" +b'y' + X, bm/IK,_1 (20|mly) exp(2imx). 
m#0 S 
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Then b= n30] (njc, b= noa (ne, and 


ie Se i 
d2 


d | gcd (m,n) 
d>0 
The sum is over all positive divisors d of the greatest common divisor of m 


and n. 
(3) The algebra generated by the Hecke operators is commutative and the multipli- 


cation formula is 


TrIm= È, Te. 
d | gcd (m,n) 
d>0 


(4) Define the Petersson inner product for Maass cusp forms f,g € SN (SL(2,Z), 
r(r—1)) by 


= ZTN ..—2 
(f,8) = Í ming RO? dx dy. 


Then (Tif, g) = (f, Tag) and thus the Hecke operators can be simultaneously 
diagonalized on the space N (SL(2,Z),r(r—1)). Note that in this case the 
Petersson inner product is the same as the usual L?-inner product. 

(5) Suppose that f € N(SL(2,Z),r(r—1)) has the Fourier expansion 


f(g) sey’ Fey!’ + F, cmVIK,_ 1 (20 |mly) exp(2mimx) 
m£0 


and that f is an eigenfunction of all the Hecke operators; i.e., 
Taf = unf, forn=1,2,3,.... 
Then 
Cn = Uln|Cn/|n| for n + 0; 


and the two associated Dirichlet series have Euler products 


1 . EO 
Ly (s-5) = SY cm|m|* = (c1 +c-1) Il (l—upp ° +p 2s) r 


2 m#0 p prime 


i 
Pi 
AETR 
ta 
+ 
NI = 
Nm 
II 
N 
a 
3 
(e) 
= 
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Conversely, if f E€ N (SL(2,Z),r(r—1)) corresponds to Dirichlet series Lr and 
Ly, with Euler products such as those above, then f is an eigenfunction for all 
the Hecke operators. 


Proof. 


(1) First note that the Hecke operators clearly commute with the non-Euclidean 
Laplace operator. It is easy to see that pf is invariant under SL(2,Z) if f is 
invariant. For if M, denotes the 2 x 2 integer matrices of determinant n, L runs 
through a full set of representatives for SL(2,Z)\Mn, and y € SL(2,Z), then 
Ly also runs through a full set of representatives for SL(2,Z)\M,. Finally, it is 
clear that if f(z) has at most polynomial growth in y, then so does Tp, f, as y 
approaches infinity. 

Suppose that 7, is given by formula (3.119). First note that Im (4) = 


ny az+b\ __ ax+b 
“y and Re (=) = Œ., It follows that 


(2 


xw 


Ta f(z) = = £ (c(nyd~*)’ +c! (nyd~?)'~") 


n=ad,d>0 
b (modd) 
a i x x Cm V MY k = lal e2him(ax+b)/d 
Jn d "2 d 
n=ad,d>0 M40 
b(modd) 


The first sum is 
1 1 
yen! 2 X d! +y! "e'na" X d?! 
O<d |n O<d|n 


/l—r 


n 1 
=cy'n' 2 0)_2;(n) + cy nT o1 (n). 


The second sum simplifies since the sum over b is zero unless d divides m. So 
the second sum is 


E 27 |\m|n PE 
Jy 5 CmK,_1 (15e) e2timnxd : 


d | gcd(m,n) 
Sum over a = n/d instead of d and set v = ma/d (so that m = nva~) to obtain 


by Cm VIK, 2a|vly)e ae 
O<a | ged(n,v) 
veZ 


This gives the stated result. 
(3) The proof is easy if gcd(m,n) = 1. For suppose the matrices 


dı di2 
0 d 
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run through a complete set of representatives for M, modulo SL(2,Z), as in 
formula (3.117) and that the matrices 


C1 C12 

0 C2 
run through a complete set of representatives for Mm modulo SL(2,Z). The 
product is 


dıcı dcij2+c2d)2 
0 doco ` 


If di2 runs through a complete set of representatives modd and c12 runs 
through a complete set of representatives modc2, then dıc12 + cod) runs 
through a complete set of representatives mod d2c2. Why? 

To complete the proof of (3), we want to show that 
T, 


p' 


kTp = Tyeri +T- for k> 1. (3.120) 


Let us set up the following notation. We shall call two matrices equivalent 
and write A ~ B if f(Az) = f (Bz), for all f € N(SL(2,Z),r(r— 1)).!? 

In order to prove Eq. (3.120), we shall multiply matrices in the set of repre- 
sentatives for SL(2,Z)\M, which are summed over in the definition (3.119) of 
the Hecke operator. We have, for e + f = k two kinds of matrices to multiply. 
First we have: 


i oe) K bmod pf 
0 p 0 p’ 
3.121 
_ G Ee RR B C y ( ) 
0 pit! 0 pit! 


Also we obtain: 


p 0 p°? bmodp! 
0 1 0 pf 


_ (p! p(bmodp^) p°? bmodpf 
7 0 pf 0 pt! 
¢ by mod pf! +p! (cı mod 
=i ie 2 cere) (3.122) 
p 
_ 1 cımodp p°? bymodp!—! 
0 1 0 pt! 
p°? bjmodp!! 
hg pit 


'0This amounts to saying that cyA = B for some y € SL(2,Z), c > 0. 
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Equation (3.121) leads to operators in T x+, except that the f+ 1 = 0 term 
is missing. Equation (3.122) gives that term when f = 1. And what remains 
in Eq. (3.122) gives operators from T x-1, each of them taken p times, when 
we take account of the cı mod p. This completes the proof of formula (3.120). 
Why? 

Mathematical induction and Eq. (3.120) imply that 


min(e,f) 


TT = py Te+f-2t- (3.123) 


From this result, part (3) follows easily. 
Note also that Eq. (3.120) implies the following formula for the formal power 
series in the indeterminate X (for prime p): 


Ș TeX = (I T)X +X). (3.124) 


e>0 


To prove this, you must simply multiply both sides by (7 — T,X +X 2) and make 
use of Eq. (3.120). Formula (3.124) will lead to Euler products for L-functions 
corresponding to eigenfunctions of all the Hecke operators. 


Exercise 3.6.13. Fill in the details that were omitted in the proof of part (3); e.g., 
in the proofs of Eqs. (3.123) and (3.124). 


(4) This argument goes back to Petersson [511]. Recall that M, is the set of 2 x 2 
integral matrices of determinant n and I = SL(2, Z). Thus Eq. (3.119) says that 
the Hecke operator T, is a sum over '\M,, and thus 


nf) = > f(Az)g(z)y dx dy. 


For A € Mn, set h(z) = f (Az). Note that (Bz) = h(z), for all B in the congruence 
subgroup 


T(n) = {B € SL(2,Z) =T | B = I(modn)} 


(see Exercise 3.6.14). 
Since a fundamental domain T (n)\H can be taken to be [I : T'(n)] copies of 
I\d (by Exercise 3.3.9 of Sect. 3.3), we have 


z _ 1 ree 
varf) = > Frey E ee (z) p 


AET\Mn 


ain du dv 
FATT Janeen 8 'w) 
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setting w = u + iv = Az and noting that [T : AT (n)A~!] = [I : P'(n)]. Part (b) of 
Exercise 3.6.14 tells us that the last quantity is none other than (f, 7,,g),/n and 
part (4) of Theorem 3.6.4 is proved. 


Exercise 3.6.14. 


(a) Suppose that A is a 2 x 2 integral matrix of determinant n and that f is a modular 
form for T = SL(2,Z). Define h(z) = f(Az). Prove that A(z) is a modular form 
for the congruence subgroup T (n). 

(b) If A runs through a complete set of representatives for [\M,,, where M, is the 
set of integral 2 x 2 matrices of determinant n, IT = SL(2,Z), show that nA~! 
also runs through a complete set of representatives for M,,/T. Then show that 
there is a common set of representatives for M,,/T and T\M,. 


Hint. 


(a) Note that nA~! € Z?*? and that B € T(n) implies that nABA~! = 0(modn). But 
then ABA“! € SL(2,Z). 

(b) For the last part of this exercise see Shimura [589, pp. 53-54]. You need to 
know that the number of left T -cosets of VAT is the same as the number of right 
T—cosets of VAT. It is easy to prove that these numbers are the same in the case 
at hand. 


(5) First suppose that T, f = unf. From part (2) of the theorem, it follows that 
Cn =Uyncy and C-n = UnC]. 


Thus 


1 : 
Lr (s- 5) = ¥ en|n| > = (c1 + 0-1) X, unn™. 


n#0 n>1 


and there is an analogous formula for Ly. The Euler product follows easily from 
formula (3.124). 
Conversely, suppose that Ly and Ly, have Euler products of the indicated 


type; e.g., 


1 ee 
Lr (s-5) = X cmm = (ci +c-1) [[ 0- up™ +p a 


m#0 p prime 


Then the following Dirichlet series has no terms |m|~* such that p divides m: 


isa ayy ae e Candy ey lal 
m#0 m#0 


Plug in m = np to see that cnp — UpCy + Cn/p = 0. Thus 
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Cap + Cn/p = UpCn. 


The left-hand side of this last equality is the nth Fourier coefficient of T, f and 
the right-hand side is the nth Fourier coefficient of up f. Thus 


T f =upf forall primes p. 


By formula (3.120), we know that T,e = TpTye-1 — Tpe-2, if e > 2. Thus we 
can show that f is an eigenfunction for T e by mathematical induction. Then 
TnTn = Tmn for ged(m,n) = 1 completes the proof of the converse in part (5), 


and the proof of the entire theorem. 


Exercise 3.6.15. Show directly that both the holomorphic and the nonholomorphic 
Eisenstein series G,(z) and E,(z) are eigenfunctions of the appropriate Hecke 
operators T}. 


Hint. Since the Hecke operator is a sum over T\My, and the Eisenstein series 
is a sum over T..\T, the problem involves the determination of representatives 
of the quotient T..\M,. That is, you must sum over a,c with gcd(a,c) = t, for t 
dividing n. Then you need to notice that the solutions b,d of ad — bc = n are not 
unique. Elementary number theory says there are t of them modulo VT... For set 
a=ayjt,c=ct, b = bọ + au, d = dọ + ciu, u € Z. Then 


a b\ fa bo 1 u/t 

c d) \e d 01 i 
You can check your calculation by noting that the eigenvalues are essentially the 
coefficients of the Fourier expansions of G;(z) and E;(z). 


We have given a very elementary discussion of Hecke operators. It is possible 
to give an abstract definition of the Hecke algebra associated to a group G and 
subgroup H, assuming that each H-double coset is a finite union of right H-cosets. 
See Krieg [373] and Shimura [589]. This allows one to do Hecke theory for 
congruence subgroups, quaternion groups, and higher rank groups like SL(n, Z). 


3.6.6 Rankin—Selberg Method, Distribution of Horocycles 
in the Fundamental Domain, and Modular Symbols 


In this section we give sketchy discussions of various topics that have been 
important for work on modular forms—the Rankin—Selberg method, the uniform 
distribution of horocycles in the fundamental domain, and modular symbols. 
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Example 3.6.5. (The Rankin—Selberg Method) There is another sort of Dirichlet 
series that can be associated to a modular form. We used this idea already in 
Exercise 3.5.8 of Sect. 3.5. The method goes back to Rankin [534] and Selberg 
[572], who used the method to obtain good estimates (though not so good as the 
Ramanujan—Petersson conjecture) for the Fourier coefficients a, of holomorphic 
cusp forms f(z) of weight k, 


f(z) = Ñ, anexp(2zinz), 


n>1 
by considering 
Lp (s) = X, Jan n. (3.125) 
n>1 
The trick is to note that 
(42)! ST (s+k—1)Ly+(s) = f TIO dy dx. (3.126) 
l 0<y;lx|< 5 


The integral here is over a fundamental domain for T..\H, where F. is the subgroup 
of T = SL(2,Z) fixing ico, defined by formula (3.82). 
Next note that we can write the right-hand side of Eq. (3.126) as 


Í E mfe y? dx dy = f E,(z)y* |f @)| y? dx dy. (3.127) 
TH yefAr TW 


Exercise 3.6.16. Prove formulas (3.126) and (3.127). 


Then the analytic continuation of the Eisenstein series leads to the desired 
estimates for the coefficients an, using methods from analytic number theory. This 
method works for congruence subgroups. However, Selberg [570] gives examples of 
discrete groups I” such that the Eisenstein series has poles in (1 — 6,1). In this same 
paper, Selberg shows how to get around this problem for I” of finite index in T, by 
going from scalar modular forms for I” to vector-valued modular forms for T. 

Part of Deligne’s proof of the Weil conjectures was motivated by the Rankin— 
Selberg argument (see Katz [344]). Many other applications of the Rankin—Selberg 
method have appeared. Some examples are Andrianov [8], Moreno [477], Novod- 
vorsky and Piatetskii-Shapiro [496], Piatetskii-Shapiro [515], Shimura [589-591], 
Stark [626, Part II], and Zagier [750]. See also Iwaniec [319]. 

There are other classical methods of estimating Fourier coefficients of modular 
forms using Poincaré series and Kloosterman sums (see Petersson [512] and Selberg 
[570]). 

Iwaniec (in joint work with Sarnak) obtains a bound on ||u||,, < C4732 for 
cuspidal Maass waveforms u with Au = Au and which are eigenfunctions of all 


the Hecke operators. The standard bound is À +. See Iwaniec [318]. 
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R.A. Smith [608] showed that if f is a cuspidal Maass waveform in the space SM 
(SL(2,Z),r(r—1)), r= 5+ it, t > 0, and f is a common eigenfunction for all the 
Hecke operators, normalized so that its first Fourier coefficient cı has absolute value 
1, then the L?-norm of f on the fundamental domain satisfies the inequality 


EEGI (Ff)? <2IP(+it)| fort > 200, 


Thus the norm of f decreases exponentially as t + œ. 

The following exercise concerns the horocycle {x + iy | x € R}, for fixed y > 0. 
It proves this horocycle to be equidistributed in the fundamental domain for the 
modular group as y — 0+. Stroémbergsson [646] improves on this result. 


Exercise 3.6.17 (Uniform Distribution of Horocycles in the Fundamental 
Domain as the Horocycle Approaches the Real Axis). Let C, C T\H be the 
curve in the fundamental domain for I’ obtained from the horizontal line at height y; 
i.e., the horocycle I’..\R + iy. Then G is a closed curve in the fundamental domain 
having Poincaré length 1/y. Show that C, fills up TH in a very uniform way as 
y —> O°, in the sense that, for any open set U inI’\H, we have 


length(C, NU) area(U) 
length(C,) area(I\H) 


asy 30+. (3.128) 


Here all lengths and areas are non-Euclidean! 


Hints (see Zagier [750]). Use k = 0 and f(z) = yu (z) = the indicator function of 
U, which is defined to be 1 on U and 0 outside U, in formulas (3.126) and (3.127). 
Then 


length(C, NU) 


1 
C(U,y) z f XU (x+ iy)dx = length(C,) 


and 
I(U,s) = / CU, y)y" “dy = | Zu (2)Es(z)y “dx dy. 
y=0 I\H 
Use a Tauberian theorem for Mellin transforms, after noting that 


Res,=1/(U,s) = ay 


Zagier [750] goes on to show that if the error term in Eq. (3.128) is O(y3/4-€) the 
Riemann hypothesis is true.!! 


|The notation “O(y?)” stands for a function which when divided by y” is bounded as y > œ. 
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One can view the integral 


I f(z)zdz 


as a special case of the period integral (for s € Z) 


[toa 


with a,b € Pr = {cusps of the discrete group I}. This observation allows Manin 
[448—451] to develop a systematic way of calculating these integrals or modular 
symbols, making use of facts about cohomology, Farey fractions, and continued 
fractions. Write {a,b}; for the modular symbol or the homology class determined 
by the geodesic joining the cusps a and b in Pr. Then one has 


{ya,yb}r={a,b}r forall yer, 
and 
{a,,ao}r+---+{ay,a;}p=0 forall sequences of cusps a1,..., ap. 


The second sort of relation brings in the Farey fractions. One can also transfer the 
action of the Hecke operators to the homology. As a result (after some modification 
of the theory when the weight of the cusp form is not 2), one can prove, for example, 
the following fact about ratios of L-series corresponding to the cusp form A(z) of 
weight 12 (the discriminant defined in Exercise 3.6.4 of Sect. 3.4). 


—691 -691 
A(z)d A(z)z7d fra A(z)z4d 1: 
(i [ra j )éae) = ( 22345 ` 23375. =] 


Such work was begun by Eichler [153] and Shimura [592]. A reference is Lang 
[391]. It is proved, for example, that ratios of certain L-values in the critical strip 
lie in the field generated by the Fourier coefficients of the corresponding modular 
form. This fits into a general philosophy of Deligne (see Deligne’s article in Borel 
and Casselman [52, Vol. 2, pp. 313-346] and Zagier [752, pp. 118—120]). Another 
paper on this subject is Razar [538]. 

Voronin [709] considers a similar sort of Dirichlet series to that in for- 
mula (3.105). Suppose that f(z) = A(z), as in formula (3.68). Let C be the vertical 
line in H starting at the point a on the real axis. Thus C is a geodesic. Consider the 
integral I(s) = J.(—i(z—a))*"|f(z)dz. It is easy to see that for Re s > 12, I(s) = 
i(27) T (s)®(s), where 


= X t(n)exp(2rina)n™, Re s > 12. 


n>1 
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Here t(n) denotes the Ramanujan tau-function defined by formula (3.66). Voronin 
manages to obtain an analytic continuation of ®(s) as a meromorphic function in 
the whole complex s-plane. 


Exercise 3.6.18 (Fourier Expansions of Other Eisenstein Series). Compute the 
Fourier expansions for some other Eisenstein series; e.g., that for the leaky torus in 
Exercise 3.6.14 of Sect. 3.3 or for some congruence subgroup. 


If you have access to a computer, the following exercise should be possible. It 
illuminates what is happening in Exercise 3.6.17 above, as well as in Fig. 3.39. 


Exercise 3.6.19 (H. Stark) (Computer-Generated Movie of a Non-Euclidean 
Shock Wave). Consider the following points on a horocycle: zj(y) =iy+j/N, j= 
1,2,..., N, holding N fixed. Find y € SL(2,Z) such that yz;(y) = w;(y) lies in the 
standard fundamental domain for SL(2,Z)\H (as in Fig. 3.14). Start with y = 2. Let 
y approach zero from above and watch what happens to the points w: (y), ..., wn (y). 
At first, you see points on a horizontal line segment of length 1 and height 2. 
As the segment moves down (y — 0+), the individual points zj(y) move along 
geodesics. You can consider the points as forming a non-Euclidean shock wave. The 
wave reflects from the boundaries of the fundamental domain as y passes through 
y = 1. Then for smaller values of y, more reflections occur and the picture begins 
to look quite chaotic. The maximum amount of chaos appears to occur around 
y = 1/N. After that the picture begins to become less random. Ultimately the points 
form various horizontal line segments which move up to the cusp at infinity. For 
example, if N = 51, there are four line segments corresponding to points with 
reduced denominators 1,3,17,51. This is related to the rational cusps approached 
by the z;(y), as y + 0. This movie can be generated on your favorite computer. Stark 
created this movie on his Atari ST which is still operating. 
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The Selberg theory allows us to actually perform calculations on noncocompact but finite 
volume universes which are of interest to those working in general relativity. 


—From Hurt [308, p. xiv]. 


3.7.1 Introduction 


In this section we seek to describe the non-Euclidean analogue of Fourier series 
for L?(SL(2,Z)\H) as well as two related analogues of the Poisson summation 
formula. See Sects. 1.3 and 1.4 for the Euclidean versions and compare them with 
the non-Euclidean results given in Theorems 3.7.1, 3.7.3, and 3.7.4 which follow. 
We shall discuss various applications in number theory, analysis, and geometry. 
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Number-theoretic applications include a non-Euclidean analogue of the circle 
problem on the asymptotics of the number of lattice points inside a circle of radius 
x, as x approaches infinity (cf. Theorem 1.3.5 of Sect. 1.3 and formula (3.136) 
that follows). We will also consider the asymptotics of units in real quadratic 
fields (in Theorem 3.7.6). There are other number-theoretic applications that will 
not be discussed here; e.g., the Eichler-Selberg trace formula for the trace of 
a Hecke operator, the Riemann hypothesis for Selberg’s zeta function, formulas 
for dimensions of spaces of holomorphic modular forms, and Langlands’ work 
on the Artin conjecture for Artin L-functions using twisted trace formulas. More 
analytic applications include the asymptotics of the eigenvalues of the Laplacian 
on L?(SL(2,Z)\H), which is another non-Euclidean version of Theorem 1.3.5 
of Sect. 1.3 (see Theorem 3.7.5 which follows), and the solution of the non- 
Euclidean heat equation on SL(2,Z)\H (Exercise 3.7.16). The Selberg trace formula 
(Theorem 3.7.4) provides a duality between the eigenvalues of the Laplacian on 
M =T\H and closed geodesics in M (if the contribution from the elliptic and 
parabolic elements of T is small). Thus one obtains geometric information about M. 
The last remarks are related to the question: “Can one hear the shape of a 
drum?” asked by Kac in [332] and discussed in Sect. 1.3. As we saw in that 
section, one can consider the problem for a plane membrane M, held fixed along 
its boundary C. If the membrane is set in motion and u(z,t) denotes the vertical 
displacement above z € M at time t, then u satisfies the wave equation 


A,u(z,t) = uy (Z,t), for z in M, 
u(z,t) = 0, for z on the boundary C. 


The solutions will be superpositions of normal modes exp(i@nt)vn(z). Here 
vn(z), n = 1,2, 3, ..., denotes a complete orthonormal set of eigenfunctions of 
the Laplacian on M which vanish on the boundary C, with eigenvalue -o2 =A,, 
where the spectrum of A is {A,, n = 1,2,3, ...}. This comes from Theorem 1.3.7 
of Sect. 1.3 on the spectra of compact self-adjoint operators. One hears the normal 
modes or pure tones @, corresponding to the eigenvalues of the Laplacian. 

The question is then, “What can you say about the geometry of M if you 
know the spectrum A,, n = 1,2,3, ... ?” It is, of course, possible to pose the 
analogous question for any Riemannian manifold. Milnor showed that there are two 
noncongruent lattices L in R!° such that the corresponding tori R!°/L have the same 
spectra. Thus you cannot hear everything about a torus. Marie-France Vignéras 
[701] found analogous examples '\H, with T's coming from quaternion algebras. 
In 1985, Sunada found a method which uses group representations to construct 
examples of drums that cannot be heard (see [649]). Buser [74] found such examples 
of isospectral manifolds coming out of graph-theoretic analogues, which themselves 
originate in examples of nonisomorphic algebraic number fields with the same 
Dedekind zeta function. But these Buser manifolds were not flat and so did not 
answer the Kac question. In 1992, Carolyn Gordon et al. [227] showed how to 
flatten the Buser examples and thus that there are (nonconvex) planar drums with 
shapes that cannot be heard. See my books [668,671] for more information on the 
graph theory analogues. 
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Although you cannot expect to hear everything about M from the eigenvalues of 
the Laplacian on M, there are still many geometric quantities that can be determined 
from the spectrum. For example, Weyl showed (in the Euclidean case) that you 
can hear the area of M. A non-Euclidean analogue of this result is contained in 
Theorem 3.7.5 which follows. 

One can also consider the problem of material diffusing through a plane region 
M with boundary C, starting at a position z, in such a way that it is absorbed upon 
hitting the boundary curve C. The density of matter py(z,w;t) at position w will 
satisfy the heat or diffusion equation 


d 

3, Pm wt) = Awpa (2, w;t), 
pam (z,w;t) +0 as w approaches the boundary, 
pm(z,w;t) > 6(z—-w) ast>0+. 


Here 6(x) denotes the Dirac delta distribution centered at the origin in the plane. If 
Vv, and A, are as above, then one can express py as follows: 


Pm(z,w3t) = 5 exp(Ànt)vn(z)vn(w). 
n>1 
As t — 0+, it is reasonable to expect that “particles of the diffusing stuff will not 
have had enough time to have felt the influence of the boundary C. As particles begin 
to diffuse they may not be aware, so to speak, of the disaster that awaits them when 
they reach the boundary,” according to Kac [330, p. 481]. Thus one expects that 


1 lz- wll? 
4t 


| as t — 0+ 


(see Exercise 1.2.13 of Sect. 1.2). It will follow upon integration that 


X$ exp(—Ant) ~ area (M)/47t, ast + 0+. 


n>1 


This is the Laplace transform of the result of Weyl mentioned earlier, which allows 
one to hear the area of the drum. We consider a non-Euclidean analogue of this 
result in the proof of Theorem 3.7.5 below. There has been much work on further 
terms in the asymptotic expansion given above (see Berger [36] and Molchanov 
[473] for references). Kac [loc. cit.] provides connections between the asymptotic 
result above and an asymptotic formula relating quantum-statistical mechanics and 
classical mechanics, as well as connections with the Wiener integral. This latter 
integral interprets diffusion in terms of a measure defined on the space of all 
continuous curves emanating from the origin. The measure is defined to agree with 
the measure coming from the Einstein—Smoluchowski theory of Brownian motion. 

These considerations have many connections with physics, as Molchanov [473, 
p. 7] describes in the following quotation: “In the physics literature the idea of 
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‘integrating along trajectories’ as a method of studying spectra has found broad 
application in quantum mechanics, mainly in the work of the Feynman school; 
the monograph (Feynman [178]) is devoted to the theory of Feynman integrals. 
It concerns ‘measures’ in a space of trajectories, which are constructed from a 
fundamental solution of the Schrédinger wave equation. A mathematically rigorous 
foundation for the Feynman theory, at least in certain of its facets, is even now still 
wanting (see the discussion in [Daletskii [116]).’ Another reference on Feynman 
integrals is the paper of DeWitt [131]. Hurt [308, 309], Gutzwiller [239-241], 
Dowker [138, 139] give some insight into the connections between the Selberg trace 
formula and the Feynman integral picture, and more. 

Most of the results on harmonic analysis on '\H, for Fuchsian groups I of the 
first kind, are due to Atle Selberg, who lectured on them at Gottingen in 1954. The 
manuscript of the part of the lectures concerning compact fundamental domains 
appeared to have been lost for many years thanks to the absence of Xerox machines. 
However, the second part of the lectures concerning noncompact fundamental 
domains such as that for SL(2,Z) was to be found in the Gottingen library (this 
is Selberg [573]) and now is in Selberg’s collected works [574, Vol. I, pp. 626-674] 
along with Selberg’s interesting commentary on the lectures. Many more things are 
in the Selberg archive section of the Institute for Advanced Study website 


http://publications.ias.edu/selberg. 


W. Roelcke was working independently on the subject around the same time 
(see Roelcke [545]). Previously Delsarte worked out some of the theory for 
the compact fundamental domains (see Delsarte [129, Tome II, pp. 599-601, 
829-845]), motivated by non-Euclidean lattice point problems. 

In this section (and indeed this chapter), we have chosen to consider only the 
example of I = SL(2,Z). In one respect, this is a nice example, for one can make 
everything in the trace formula very concrete. However, in another respect, this 
example, is a difficult one, for it produces parabolic terms in the trace formula— 
parabolic terms that are rather complicated. Even the elliptic terms are not trivial. 
So the reader might want to replace [ by some nice arithmetic group with compact 
fundamental domain and no elliptic fixed points. Examples of such groups were 
discussed in Sect. 3.3. 

General references for this section include Borel and Casselman [52], Borel and 
Mostow [54], Bump’s introduction to the trace formula in Bernstein and Gelbart 
[37, pp. 153-196], Bump [71], Chavel [84], Duistermaat et al. [143], Elstrodt [155, 
156], Elstrodt et al. [157—160], Faddeev [171], Fay [176], Gangolli [191], Gangolli 
and Warner [195, 196], Gelfand et al. [203], Godement [213-216], Goldfeld and 
Hundley [219], Goldfeld and Husemoller [220], Hejhal [261-271], Hurt [308], 
Iwaniec [318,319], Kubota [375], Lang [389], Lapid [395], Lax and Phillips [400], 
Marklof’s introduction to Selberg’s trace formula in Bolte and Steiner [47, pp. 83- 
120], McKean [461], Roelcke [544-546], Sarnak [555-557], Selberg [569, 571, 
573,574], Subia [647], Tamagawa [656], Venkov [693,695], Venkov et al. [696], 
Marie-France Vignéras [700], Voros [710], Dorothy Wallace [711], and Warner 
[718]. The Sarnak homepage at IAS also has enough ideas to keep you busy for a 
very long time. And don’t forget the Langlands archives at IAS. 


320 3 The Poincaré Upper Half-Plane 
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We shall begin by discussing the spectral resolution of the non-Euclidean Laplacian 
on the fundamental domain. This consists of an expansion of an “arbitrary” function 
f : SL(2,Z)\H — C in eigenfunctions of A. Such an expansion is a non-Euclidean 
analogue of Fourier series, but it involves a mixture of series and integrals. 
Because the fundamental domain is not compact, the eigenvalue problem for A on 
SL(2,Z)\H is said to be singular. There may be continuous or discrete spectra 
(or both) in such cases. For example, recall the eigenvalue problem arising from 
the quantum mechanics of the hydrogen atom (see formula (2.14) of Sect. 2.1). 
See Stakgold [618,619] and Titchmarsh [679] for other examples of eigenvalue 
problems with mixtures of continuous and discrete spectra. 

The reader should recall from Sect. 3.5 the basic facts about the spectrum of the 
non-Euclidean Laplacian on SL(2,Z)\H. Let us now collect the basic ingredients 
for our cake. 

The Various Parts of the Spectrum of the Non-Euclidean Laplacian on 
SL(2,Z)\H: 


(a) The discrete spectrum of constants and cusp forms, with orthonormal basis 


{vn}n>0; Yo = re Vn = cusp form, n> 1. 
Avn = Sn (Sn = 1)vn, Sn E 5 +iR, (3.129) 
Vn(z) ~ 0, asz > œ, forn > 1. 


(b) The continuous spectrum of Eisenstein series, with the Eisenstein series 
defined in formula (3.81), 


{E,|s=4+it, t €R}, 
AE, =s(s— 1)Es, 3.130) 
Es(z) wy t E T, asz—=> o, j 


where A(s) = nT (s)¢ (2s). 


Exercise 3.7.1. Show that the three spaces of Eisenstein series, constants, and cusp 
forms, respectively, are mutually orthonormal. 


Note. Throughout this section the inner product on Z? (SL(2, Z)\ĦH) is 
H= f FOD dx ay. 6-131) 
SL(2,Z)\H 


The Eisenstein series are not in L?(SL(2,Z)\H) by Exercise 3.5.11 of Sect. 3.5, 
but they are in L'(SL(2,Z)\H), when s is the critical strip. See also Exercise 3.5.12 
of Sect. 3.5 and Selberg [571, pp. 183-184] and [574, Vol. I, pp. 650 and 672] for a 
way of truncating Eisenstein series in order to make sense of inner product formulas 
involving them. 
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Our first goal is to prove the following theorem. 


Theorem 3.7.1 (The Roelcke-Selberg Spectral Resolution of the Laplacian A 
on L? (SL(2, Z)\H)). Any f in L?(SL(2,Z)\H) has the non-Euclidean Fourier 
expansion 


1 
FO = LU vam z f OEEO. 


es=5 


Here we use the notation of Eqs. (3.129), (3.130), and (3.131). 


Before proving Theorem 3.7.1, note that the spectral measure (4zi)~'ds for 
the continuous part of the decomposition is computed from the asymptotics and 
functional equation of the Eisenstein series in accordance with the principle that 
worked in Theorems 3.2.1 and 3.2.2 of Sect. 3.2. For we know from Exercise 3.5.2 
of Sect. 3.5 that 


E;(z)~y*, asy—>œ, fors fixed with Res > 1. 


And formula (3.68) shows that E;(z) has a functional equation relating s to 1 — s. 
Thus we find that the spectral measure should be 


spectral measure for usual Mellin inversion in Sect.1.4 1 


number of functional equations 4ni 


Lemma 3.7.1, which follows, gives a rigorous derivation of this spectral measure. 
Since E,(z) has a pole at s = 1, there is more happening in Theorem 3.7.1 than 
we saw in Theorems 3.2.1 and 3.2.2 of Sect.3.2. And we will find that Cauchy’s 
residue theorem throws Res,—1£;(z) into the discrete part of the spectrum of A on 
SL(2,Z)\H. 

The proof that follows is decomposed into a sequence of lemmas that will only 
end with Theorem 3.7.2 concerning the discrete part of the spectrum. The method 
is that of R. Godement (see Borel and Mostow [54, pp. 211—234]). Other references 
for this discussion are Kubota [375, Chap. 5, including footnotes and a remark after 
Theorem 7.5.5] and Lang [389, Chap. 13]. 

The main idea of the proof is to reduce to the ordinary Mellin inversion formula 
by making use of a series we shall call the “incomplete theta series.” 


Definition 3.7.1. The incomplete theta series or Poincaré series Ty(z) for a 
sufficiently nice function y : Rt — C is given by 


Ty(z)= ¥ w(im(yz)) for ze H. (3.132) 


yeTo\P 
Here To = 4 + En 
01 


ne z} as in formula (3.82). 
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The following exercise gives some explanation of the word “nice” in the 
preceding definition. 


Exercise 3.7.2. 


(a) Show that if y has compact support, then the series (3.132) has only finitely 
many terms. 

(b) Show that if there exists a k > 1 such that |y(y)| < y‘, for all y > 0, then 
series (3.132) converges. 

(c) Give conditions on y that suffice to put T y in L?(SL(2,Z)\H). 


Exercise 3.7.3 (An Adjoint for the Operator 7). 
(a) Define the constant coefficient operator on f € L?(SL(2,Z)\H) by 


cro) = [feet iy)dx 


Show that this operator pulls out the constant term in the Fourier expansion 


That is, show that (Cf) (y) = co(y). 
(b) Show that if T denotes the incomplete theta series (3.132), and the inner product 
is as in formula (3.131) 


Tyf = | VOCO yas, 


assuming f is real-valued. 


Hint. Start with 


Ja p z ae 


This same idea was used in formulas (3.126) and (3.127) during the discussion of 
the Rankin—Selberg method. 


Lang [389, pp. 241-243] notes that the adjointness relation in the last exercise 
happens for any pair of closed subgroups T,N of a Lie group (or just a locally 
compact topological group) G. Here 


rf) 


And N\G = R? — 0 via the identification Ng — (0,1)g, g € G. The G-invariant 
measure on N\G is Lebesgue measure on R. Lang shows that the same calculations 
that give part (b) of Exercise 3.7.3 say that 


xE R}, T = SL(2,Z), G=SL(2,R). 
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Ty.Ane =f TOTO [wo f pp 0n ay 
= (V,CF)M G: 


defining 


TWO) = f Yar and cf) =f. f0)An 


Lemma 3.7.1 (Properties of Incomplete Theta Series). Let 09 denote the closed 
subspace of L?(SL(2,Z)\H) generated by Tw such that My(—1) = (Tw,1) = 0, 
where Ty is the incomplete theta series (3.132) and y is smooth with compact 
support on R* and Mellin transform 


= [ yyy dy. 


Define 
Lo(SL(2,Z)\H) = { f € L’(SL(2,Z)\H) | (Cf)(y) =0 for almost all y > 0}. 


Here C(y) is the constant term operator in Exercise 3.7.3(a). Then we have the 
following list of assertions. 


(1) There is an orthogonal decomposition 
L? (SL(2,Z)\H) = L2(SL(2,Z)\H) ® CO &. 
(2) For c > 1 and E,(z) the Eisenstein series defined by formula (3.81), 


1 


Ty(z) = ani 


f __ MYSE, (z)ds. 
(3) If p(s) =A(1— s)/A(s) and A(s) = n™T(s)¢ (2s), then 


(Ty, Es) =My(s—1)+ ọ(5)My(-5). 


(4) Set vè = (area(SL(2,Z)\H))~! = Res,—1E;(z). Then vo is in the discrete part of 
the spectral decomposition of A on SL(2,Z)\H, and 


T w(z) = (TwW,vo)vo + wl = (Ty, Es )Es(z)ds. 
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Proof. 


(1) From Exercise 3.7.3, we have 
(Tys) =f  wontCayoWv ay 


where C(g) is the constant term in the Fourier expansion of g. It follows that 
the orthogonal complement of the L?-space spanned by the Tys is the space 
13 (SL(2,Z)\H). 

Next use Exercise 3.7.3 to see that 


(Ty, 1) =My(-1). 


This means that we can decompose the L?-span of the incomplete theta series 
into an orthogonal direct sum of the constants and 09. To see why the constants 
come out discretely in the spectral decomposition, see the argument in the proof 
of part (4). 

From the Mellin inversion formula of Sect. 1.4 it follows that 


1 S 
vo) = z h, Mu(—sjy'ds. 


(2 


xw 


In order to prove (2), use the definitions of T y in Eq. (3.132) and the Eisenstein 
series E, in formula (3.81) and sum this Mellin inversion formula over T..\T 
for Res = c > 1. This last condition is necessary for absolute convergence. 

(3) This follows easily from Exercise 3.7.3 and the Fourier expansion of E, given 
in Exercise 3.5.4 of Sect. 3.5. 

(4) For c > 1, we have from part (2), 


2Ty(z) = = a __ 2M W(-s)Es(e)s 


To use the residue theorem and move the integral over to the line Res = 7 we 
must find the poles of the integrand in 5 < Res < c. Now My(—s) is entire, 
since y has compact support. Thus the only pole of the integrand in the region 
is at s = 1 with residue 


2My(—1)Ress—1E;(z) = 2(T y, 1)(vol(P\H))! 


So we have 


2T (z) =2(TY,vo)vo + ae —s)E,(z)ds. 


Res= 


Now we use the functional equation of E, (noting that |@(s)| = 1, for Res = 5) 
to write 
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f 2My(—s)Es(z)ds 
Res=5 


= (My(—s)Es(z) + 9(s)Mw(—s)E1_s(z)) ds 


ER | 
Res=5 


= (My(—s)Es(z) + (1 —s)My(-(1 — 8) )Es(z)) ds. 


MOE | 
Res=5 


The last equality follows from replacing s by 1 — s in the second part of the integrand. 
Now note that when Res = 5 we have —s = 5— 1 and 1 — s = 5. Thus, it follows 
from part (3) that 


1 
2Ty(s) = 2(Ty,v0vo-+ — f (Ty, E,)Es(2)ds, 
2ri Res=4 


as was to be proved. 


Exercise 3.7.4. Fill in the details in the proofs of parts (2)-(4) of Lemma 3.7.1 
above. For part (4), note that you need bounds on the integrand to move the lines of 
integration. The argument is similar to that used for ordinary Mellin transforms. 


Exercise 3.7.5 (The Size of E,(z) as t Approaches Infinity and the Residue at 
s=1). 


(a) Show that E;(z) = o(t2*®), as t approaches infinity (for fixed z), if s = o + it 
and © > 5: 


Hint. Use the Phragmen-Lindelof theorem (see Lang [388, pp. 262-267] or 
Titchmarsh [680]). Note that Es(z) = O(exp(t*)) for some a > 0, using the 
incomplete gamma expansion (Theorem 1.4.1 of Sect. 1.4) or the Fourier series 
(Exercise 3.5.4 of Sect. 3.5). Deduce that for Re s > 1, E;(z) = O(1), using 
the definition of Es(z) as a Dirichlet series. Obtain the estimate for Es(z) when 
Re s > 0, using the functional equation 


A(1—s) A(1—s) 


E;(z) = KO E,_s(z) = Kea) 
2 


E\_;(z), with A(s) = 2 °T(s)¢ (2s), 


and using Stirling’s formula for T(s). One could actually improve the estimate, 
making use of Hadamard’s three lines theorem, and one could do even better if 
one knew the Riemann hypothesis. 


(b) Show that Res,—)£,(z) = (area(SL(2,Z)\H))~!. You could use the Fourier 
expansion of E, to do this, for example, plus the fact that €(2) = x*/6. Or 
you could use the incomplete gamma expansion of Epstein’s zeta function 
(Theorem 1.4.1 of Sect. 1.4). This is a general property of Eisenstein series as 
we shall see in Volume II or [667] (see also Sarnak [558] and Siegel [600, 
Vol. I, pp. 459-468; Vol. II, pp. 328—333]). 
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Lemma 3.7.1 completes the portion of the proof of Theorem 3.7.1 coming from 
the continuous spectrum and the one-dimensional space of constants in the discrete 
spectrum. This latter part of the discrete spectrum is often called the “residual 
spectrum” since it comes from a residue of the Eisenstein series. This use of the 
words “residual spectrum” conflicts with that of functional analysis. There are 
groups T° such that the Eisenstein series have arbitrarily many poles on the real 
line (see Selberg [571, p. 184]). Next we must deal with the rest of the discrete 
spectrum of A. We will show that L3(SL(2,Z)\H) lies in the discrete spectrum of A 
(see Theorem 3.7.2 below). 

At this point one might wonder why L}(SL(2,Z)\H) is not the zero space. This 
is equivalent to showing that nonzero cusp forms exist. One can deduce this from the 
Selberg trace formula itself. It is fairly easy to show that there are odd cusp forms 
for SL(2,Z), as the following exercise shows. See our discussion of the Roelcke— 
Selberg conjecture in Sect. 3.5.5. 


Exercise 3.7.6. 


(a) Show that L3(SL(2,Z)\H) contains a nonzero element. 
(b) What goes wrong with trying to find an even element of L5(SL(2,Z)\H) by 
starting with a classical holomorphic cusp form of weight k and forming g(z) = 


FPI 
Hint on Part (a) (H. Stark). Take an odd function g(x) of period 1; e.g., 


sin(2anx), x€ R, n € Z. Then form a function f(z) on the standard fundamental 
domain D for SL(2, Z) pictured in Fig. 3.14 by setting 


fai g(x) for z = x+ iy, with y> 2, 
= L0 forz=x+iy, with y <2. 


We could actually do this smoothly. Then extend f(z) to a function F (z) by writing 
F(z) = f(yz), if y € SL(2,Z) is such that yz € D. Note that the lack of uniqueness 
of y for z on the boundary of D does not matter since f is the same at equivalent 
boundary points. You need to show that the function F (z) is an odd function of x. 
This comes from the fact that f(z) was an odd function of x and the identity 


ENE _ fab P a —b 
¥(-zZ)=—-Yz fory= (24) esez and 7 = ( ar 


—c 


In order to study the discrete spectrum of A it is natural to attempt to replace A 
with integral operators. Here we shall use convolution operators. It is also possible 
to study the resolvent: 


Aar = Gy (z,w)f aes (3.133) 


wel \H 
w=u-+iv 
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The kernel Gy (z,w) in formula (3.133) is called the Green’s function for A — À. 
We studied the Euclidean analogue of this Green’s function in formula (1.16). 
References for the non-Euclidean Green’s function are Elstrodt [155,156], Faddeev 
[171], Fay [176], Hejhal [261], Lang [389], Neuenhoffer [494], and Roelcke 
[544, 545]. The poles in A of the Green’s function G; are the eigenvalues of A on 
L’(SL(2,Z)\H); i.e., the discrete spectrum. Branching in A for G} gives rise to the 
continuous spectrum and the Kodaira—Titchmarsh formula allows the computation 
of the spectral measure from the jump across the real axis [see formula (2.26)]. See 
Example 3.7.2 and Exercises 3.7.10—3.7.15 of this section for more information on 
Green’s functions. 

In order to define convolution as in Sect. 3.2 we must think of functions on H = 
G/K, with G = SL(2,R) and K = SO(2), as functions on G by writing f(a) = f(ai). 
Here ai means the point in H to which a € SL(2,R) maps i = /—1 by fractional 
linear transformation (see Exercise 3.1.8 of Sect.3.1). Suppose then that we have 
two integrable functions f,g : H — C. Then define the convolution of f and g by 


Lef(a) = (f *a)(a )= frost (3.134) 


Here db denotes left = right Haar measure on G (see Sect. 2.1 and Exercise 3.2.19 of 
Sect. 3.2). The measure db is unique up to a positive constant and is characterized 
by its invariance under the group action. General results on Haar measure can be 
found in Helgason [277] and Lang [387]. We do not need any explicit formula for 
db, just its existence. 


Note. There is a slightly confusing aspect of the identification f(ai) = f(a), for 
a € SL(2,R). For if we adjust the Haar measure da so that 


[ faiaa= f f(x+iy) y 7dx dy, 
G H 


we discover quickly that the integral on the left is invariant under the substitution of 
a`! for a, but not the integral on the right! For 


x+y = teyra f 

X“Vors o yey? 
1x y2 0 y! 0 1 —x =i 
01 0 yl? ee yJ oI] y’ 


Let u + iv = (i — x) /y. The Jacobian of this change of variables is 


and 
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Therefore 


LG G he) pid= [farida 


This is a reflection of the non-unimodularity of the group of upper triangular 
matrices with determinant 1 and positive diagonal. The moral is that we should 
be somewhat careful about our identifications. 

As an example, let us compute Ly f (i), for f(z) = ps(z) = (Imz)*. The result is 


LePps(i ) = f pb (bi)g Idb = Jr *)y dx dy 
emia 


where ĝ(s) is the Helgason transform of g defined in formula (3.32) of Sect. 3.2, 
assuming that g is in fact rotation-invariant. 


Lemma 3.7.2 (Properties of Convolution Operators). Suppose throughout that 
g: H — Cis infinitely differentiable with compact support. 


(1) The operator Lz defined by Eq. (3.134) commutes with the left action of x € G 
on functions. Thus Lg is a G-invariant integral operator such that 


Lg : L’(SL(2,Z)\H) —> C“(SL(2,Z)\H). 


(2) If g(a) = g(a!) for all a in G and if g is real-valued, then Lg is a self- 
adjoint operator on L?(SL(2,Z)\H)); i.e., (Lg f,h) = (f,Lgh), using the inner 
product (3.131). 

(3) Leenf = LgeLn f- 

(4) The operators Lọ commute if g is K-bi-invariant, K = SO(2); that is if g : 
K\G/K > C. 

(5) If g(a) = g(a™!) for all a € G, then A(Lg) = LA and AL; = Lag. 

Proof. 


(1) For x in G, set f*(y) = f(xy). Then, using the left invariance of Haar measure 


db, we obtain 
= | Flsb)e(-"a) db 


= | Feet (b-!xa) db = (Le f)*(a). 
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(2) To see this, write 


(Lef,h) =) exit ® g(b-!a) db h(a) da 


= i! Po [ gual © Z 80 a) db f(a) aa 


yer 


= f fg alifa) da f(b) db 
G/K aK? 
= (f,Lgh), since g(a) = g(a!) and g is real-valued. 


(3) This follows from the associative law for the convolution of L! functions. 
(4) We leave this as an Exercise similar to Exercise 3.2.19. 
(5) See Exercise 3.7.7. 


Exercise 3.7.7. Prove part (5) of Lemma 3.7.2. Do both formulas require g(a) = 
g(a_!)? If g is K-bi-invariant, does it follow that g(a) = g(a7!)? 


Lemma 3.7.3 (The Connection between the Eigenvalues of Convolution 
Operators and Eigenvalues of the Laplacian. The Selberg Transform and 
the Helgason Transform). 


(a) Suppose that Lif =Agf, Ag € C, for all g € CZ(K\G/K). Then Af = uf, 
and the transform u — Ag, which is often called the Selberg transform, is 
essentially the Helgason transform defined by formula (3.32) for rotation- 
invariant functions g. 

(b) More precisely, suppose that g : K\G/K — C is infinitely differentiable with 
compact support. Let ọ be any eigenfunction of the non-Euclidean Laplacian 
with Ad = s(s—1)o. For example, ọ could be a cusp form, or y’, or the 
spherical function 


Moe | (Im(kz))°dk = P,_;(coshr), if c=ke ic H,kEK, 
keK 


with P;,—ı = the Legendre function of Exercises 3.2.9 and 3.2.10 of Sect. 3.2. 
Then Ag = s(s— 1) implies that 
md 
Q 


o) = Eo) =a1-9= f ey dray 


= the Helgason transform of g; 


i.e., if the eigenvalue of A corresponding to @ is u = s(s— 1), then the eigenvalue 
of Lg corresponding to @ is Ag = ê(1 — 5). 


Proof. 
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(a) Suppose that f * g =A, f. Then 


Af =A(f* 8) = f* (Ag) = agf implies Af = uf, 


with u = Aag/Ag. Note that if g runs through a Dirac family at the identity, we 
can assume that A, is nonzero. 


Question. Does it suffice to know that L, f = A,f for all g in a Dirac family at 
the identity? 


(b) Let A be the operator that averages over the compact group K = SO(2). Then 


Aol) = | o (kak, 


where dk is left (which equals right) Haar measure on K = SO(2), normalized 


so that 
| dk=1. 
K 


We have proved the uniqueness of the spherical function h;(z) = P,_1(coshr), 
for z = ke "i, with k € K, r > 0. See formula (3.25) of Sect.3.2 with a = 0 
and Exercise 3.2.9 of that section. Thus A@ = ch, with c = (i), for i = V/—I. 
It follows that 


AC *8)(a) = ((AG) * 8) (a) = ((chs) #8) (a) = c(hs * 8) (a). 


Evaluate this at a = the identity, 7, to obtain: 


Leb) = (0 *8)(T) =c(hs*g)(1) where c= 91) = ọ (i), 
by our identification. Therefore we can set f(z) = y’ and obtain 


as we saw in the note after the definition of the convolution operator in for- 
mula (3.134). 

Note that if Af = uf, then Lyf = A,f. For if a € G, set f*(x) = f(ax), when 
x € G. Then A(f“) = (Af)* = u f°. So, using the preceding results, we find that: 


() =) = | eddy = g(1-9), 


LPO) = Lf) = 80 - SFO), 


which implies that (Lg f)(a) = (1 — 5) f(a). 


We are now in a position to prove the remaining part of Theorem 3.7.1. 
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Theorem 3.7.2. Suppose g : H — C is infinitely differentiable with compact 
support. Let Lg be the convolution operator in Eq. (3.134). Then Lg is a compact 
operator on the space L}(T\H), T = SL(2,Z), defined in Lemma 3.7.1. Thus 
the spectrum of A on LŻ(T\H ) is discrete and the spectral theorem for compact, 
self-adjoint operators (see Theorem 1.3.7 of Sect. 1.3) assures the existence of a 
complete orthonormal set of cusp forms spanning L}(T\H). 


Proof. Let g : K\G/K — C be C” with compact support for G = SL(2,R), K = 
SO(2). We want to show that Lg is a compact operator on L4(T\H). Suppose that 
|||| denotes the sup norm for functions on G; i.e., 


Il fl] = sup{|f()|, x € G}. 


And let ||||2 denote the Z? norm for functions on '\H; i.e., 
Ifl = Ga with the inner product in Eq. (3.131). 


We want to show that there is a positive constant C such that for any f in 
LP \H), 


[Le fll < Cllflo- (3.135) 


This will imply that Lg is a compact operator on L}(I'\H). To see this, note that 
the image of ie under Lg is an equicontinuous family of continuous functions. If 
we can show the family to be uniformly bounded, then the theorem of Arzela- 
Ascoli (see Kolmogorov and Fomin [366]) implies that sequences {L,f,,} must 
have subsequences converging on compacta in I’\H to continuous functions. Lang 
[389] gives another proof of this. 

Now, in order to bound the L, fs by the Lz norms of the f’s, we argue as follows, 
using the Poisson summation formula from Sect. 1.3. Assuming that g(a) = g(a~'), 
we can write 


Lafla) = | Fela 2y dx dy, 


Suppose next that 


ceo lv 
=j g 
a -e(o e) (1) withk € K,d>0, veER. 


It is possible to make this decomposition by Exercise 3.1.9 of Sect. 3.1. In the gen- 
eral case considered in Volume II [667], this is called the Iwasawa decomposition 
of ain G. 

We are trying to show that L,f(a) is bounded as ai = —v + i/d approaches 
infinity in H; i.e., as v approaches + or —ce, or d approaches 0 or infinity. We 
will assume that g(x + iy) = h(x)k(y) to simplify the calculations, since g can be 
approximated by such functions. Now set 
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r={(o7)|ne2} 


Then 
Lfl) = | Yglde+n+v))f@y? dxdy 
r.\n"<4 
= J > h(d(x+n+v))k(dy)f(z)y~* dx dy. 
r.\n" = 


Poisson tells us (using the dilation and translation properties of the Fourier 
transform on R) that 


$ A(d(x+n+yv)) = a ` Î(n/d)exp|2rin(x+v)], 


neZ neZ 


where f denotes the Fourier transform of h over R. The fact that h is C” with 
compact support implies that |A(x)| < c|x|~? for any power p. So if n 4 0, we have 
a bound. If n = 0, since f is in Li, we find that 


h(0)k(dy) f(z)y? dx dy =0. 
T..\H 
Thus we have 
\Lef(a)| < cd?! I Y |nl-Plk(dy)||F(z)|y? dx dy. 
Ta \g "70 


We have assumed that the function k has compact support. Thus for d small 
enough, the preceding integral will be bounded by a constant times 


a= f IFO? dx dy. 
|x[<1/2 
y>2 


The Cauchy—Schwarz inequality completes the proof of the desired inequal- 
ity (3.135). Finally, the proofs of Theorems 3.7.1 and 3.7.2 are complete. 


Question. Suppose we replace SL(2,Z) in Theorem 3.7.1 by some congruence 
subgroup T (N). Can you formulate harmonic analysis on T(N)\H as some sort of 
product of that on SL(2,Z)\H with that on the finite group SL(2,Z)/T(N)? 
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3.7.3 A Non-Euclidean Poisson Summation Formula 


It is now possible to develop a non-Euclidean analogue of the Poisson summation 
formula, using the same argument that proved Theorem 1.3.2 of Sect. 1.3. 


Theorem 3.7.3 (A Non-Euclidean Poisson Summation Formula). Let f : H — C 
be K = SO(2)-invariant; i.e., f (kz) = f(z) for all k € K, z € H and suppose also 
that for T = SL(2,Z), the series 


szv= > fm) 


yer {1} 


converges absolutely and uniformly on compacta in H to a function g(z) such that 
Ag € L? (T\H). Then, using the notation (3.129) and (3.130), 


e0 = E fmz f AED ELas, 


4ri 
with f(s) =the non-Euclidean Fourier (or Helgason transform) given by 
f(s) = | FOF y? dxa. 


Proof. By Theorem 3.7.1, we can write 


e0 = Elem tg f 8E) Ods 


n>0 Ani es=5 


Convergence is uniform and absolute by Exercise 3.7.17 that follows. Using the fact 
that H is a disjoint union of translates by y € T/ {I} of the fundamental domain 
T\H, we see that 


F(yz)vn(z) y? dx dy = f, Ff (z)vn(z) y” dx dy. 


(ev) = f 
T\H Pik 


i AR 


Then Lemma 3.7.3 implies that (g, vn) = f (Sn)Vn(i). The formula with v, replaced 
by E; is proved similarly, to complete the proof of Theorem 3.7.3. 


Later, while discussing Selberg’s trace formula, we shall find another derivation 
of Theorem 3.7.3 (from the non-Euclidean analogue of Mercer’s theorem which was 
Theorem 1.3.8 of Sect. 1.3). 

The analogue of Theorem 3.7.3, for I such that T\H is compact, goes back 
to Delsarte in 1942. Delsarte used this result to study non-Euclidean lattice 
point problems such as the asymptotic result given in formula (3.136) below. 
Others have used Theorem 3.7.3 to study Green’s functions for the Laplacian on 
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T\Æ (see Exercise 3.7.14 below). It is also possible to use the non-Euclidean 
Poisson summation formula to study the Poincaré series that arise in work on the 
Ramanujan—Petersson conjecture for Fourier coefficients of modular forms (see 
Bruggeman [68], Deshouillers and Iwaniec [130], Goldfeld and Sarnak [221], 
Kuznetsov [381], and Selberg [570]). Kudla and Millson [379] use non-Euclidean 
Poisson summation to give an explicit construction of the harmonic one-form dual 
to an oriented closed geodesic on an oriented Riemann surface of genus greater 
than 1. Takhtadzhyan and Vinogradov [654] use such results to obtain relations 
between fundamental units of real quadratic fields and the existence of cusp forms 
with eigenvalue 4 for To(d). 


Example 3.7.1 (Non-Euclidean Lattice Point Problems). References include Buser 
[74], Delsarte [129, Tome II, pp. 599-601, 829-845], Huber [306], Mennicke [463], 
Nicholls [495], and Patterson [510]. Recall first that in Theorem 1.3.5 of Sect. 1.3 
we found an asymptotic result for the number of lattice points; i.e., elements of Z7, 
in a circle, as the radius of the circle blows up. One non-Euclidean analogue of 
this is 


Nr(x) = #{y eT / {£0} | coshd(i, yi) = (3.136) 


3 2x 
area(I \H) ? as X —> 09, 


for I = SL(2,Z), where d(z,w) denotes the non-Euclidean distance between points 
z,w € H. Note that 27 /area(I'\\H) = 6 in this case. 


You might be worrying about the hyperbolic cosine in the definition of Nr(x) 
in formula (3.136). Hopefully the following exercise will allay these worries by 
indicating that there is a generalization to SL(n,Z) or GL(n,Z) (which will be 
considered in Volume IT or [667]). 


Exercise 3.7.8. 


(a) Recall Exercises 3.1.9 and 3.1.10 of Sect. 3.1. These exercises show that if z = 
ke~'i, for k € K = SO(2), r > 0, and W, is the corresponding 2 x 2 matrix in 


SP2, then 
10 etl? 0 
— k 
"i Ge ( 0 vr) |; 


Here we use the notation Y [A] = ‘AYA, with ‘A = transpose of A. Show that 
f 1 _~ 1 
coshd(i,z) = ale" te A= z TEW). 


Then show that if 
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2coshd(yi,i) = Tr (Yy) =@ +b +° +a’. 


my) 
(b) Prove that 2coshd(z,w) +2 = ~= 


ImzImw* 


Hint. See Exercise 3.2.25 of Sect. 3.2. 


From Exercise 3.7.8, we see that formula (3.136) is equivalent to the following 
asymptotic formula: 


sye T/{+1} | st) <x} ~ 6x, aS x ©, 
which means that 
#{a,b,c,d€Z | +b +P +d <x, ad—be=1} ~ 6x, as x +0. (3.137) 
We know from Theorem 1.3.5 of Sect. 1.3 that 
#{a,b,c,d EZ |a +b +P td <x}~mx, as xe. (3.138) 
A comparison of these two asymptotic results gives one a good feeling for the 
relative densities of these two sets. 


In order to prove Eq. (3.136), one may imitate the proof of Theorem 1.3.5 of 
Sect. 1.3. To this end, we form the non-Euclidean theta function !? for a > 0: 


(a= X, exp | —Sat("7)| = $, exp[—acoshd(i,yi)]. (3.139) 


ycT/{1} ycT/{1} 


Set 
fale) =expl-avoshd(i.)] = exp -Za TW] forze H, 


where W, is the corresponding matrix in SP3 from Exercise 3.1.9 given for z = 


x + iy by 
_(l/y 0 1 x| ~ip 7 
a (o o) E 1 = + Sy (3.140) 


The Helgason transform is 


fils) = / exp -5a Tr (W| yar dy = 4] EK, (a). (3.141) 


2 
W:ESP2 


12Our non-Euclidean theta function in Eq. (3.139) is not the same as that considered by differential 
geometers (see Molchanov [473, p. 46] and the proof of Theorem 3.7.5). 
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To prove Eq. (3.141), use formula (3.140) to see that Tr(W,) = (1+ (x? +y7)) /y, 


and 
tive f (Ees 


y>0 xeR 


Perform the integral over x and obtain 


—ax? 2my 
I exp (=) x= “a 


xeR 


to- yE [reel S(t] & 


A formula from Exercise 3.2.1 of Sect. 3.2 parses the proof of Eq. (3.141). 
Applying Theorem 3.7.3 to f(z) = exp{—2 z4 Tr(W-)], we have 


It follows that 


@r(a)= E exp|-STH("yy)| 


yel/{+]} 


27 
Vali Pe a K__1(a) \Es(i 2ds. 
PRAO] VF ta he, O BO 


(3.142) 


In order to obtain the asymptotics of @r(a), as a + 0+, we need the following 
results: 


om 
Kin (a) = ew? (cos [wlogS —wlogu+u+ “| +0(1)) , as uo, a —> 0; 


(3.143) 


1 
Ey ul) =0 (lut) as ure. (3.144) 
Formula (3.144) was Exercise 3.7.5 and formula (3.143) comes from Exercise 3.7.9 
below. 

One finds that the main term on the right-hand side of Eq. (3.142) is the term 
corresponding to vo, and the rest is O(1); i.e., bounded. It follows that 


20 
~ — 0+. 3.145 
ora area([\H)a er pei ( ) 
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One finishes the proof of Eq. (3.136) with the same Tauberian theorem that sufficed 
to prove Theorem 1.3.5 of Sect. 1.3 (namely, Theorem 1.2.5 of Sect. 1.2). 


Exercise 3.7.9. Prove formula (3.143) using the following facts about J and K 
Bessel functions (see Lebedev [401, p. 140]): 


 I-s(z) —Is(z) 


Ks(z) = 2 sin(s7) 


(taking limits if s € Z); 
ae (22/4) 

no = (5) > ar k+1)" 

oki (stkt+1) 


Example 3.7.2 (Green’s Functions for A— à on T\H by the Method of Images). 
References for this subject are Elstrodt [155,156], Faddeev [171], Fay [176], Hejhal 
(261,264, Vol. 2], Lang [389, Chap. 14], and Neuenhoffer [494]. We discussed the 
method of images briefly in Sect. 1.3. Here we seek a non-Euclidean analogue of 
the Green’s function in Exercise 1.3.23 of Sect. 1.3. 

The Green’s function G,(z,w) for the resolvent R} = (A—AJ)~! on the 
fundamental domain I'\H, T = SL(2, Z), satisfies 


Ry f(z) = f Gy (z,w)f(w) v? du dy. (3.146) 


w=u+ivel\H 


The method of images says that if g} (z,w) is the kernel for the resolvent R} on all 
of H, then 


Gu(zw)= > gaz w). (3.147) 
ycT/{1} 


We will show that g} (z,w) = + 0,_1(coshd(z,w)), if A = s(s — 1) and Q,—1 
denotes the Legendre function of the second kind. See formula (3.149). 


So let us find gy (z,w). In order to do this, one should recall the basic facts about 
Green’s functions (see Courant and Hilbert [111, pp. 363-388], Garabedian [197] or 
Stakgold [618,619]). In particular, g} (z, w) should have the same sort of singularity 
as that of the Euclidean Green’s function for the Euclidean Laplacian on R*. The 
reason for this is that the y* in the non-Euclidean Laplacian is cancelled out by the 
y7? in the G-invariant area element on H. This always happens when calculating 
Green’s functions for differential operators with weights. 

So the singularity of g,(z,w) is the same as that of (1/27) log|z—w]. Also, 
gı (z,w) must be G = SL(2, R)-invariant; i.e., g} (az,aw) = gy (z,w) for all z,w in H 
and a in G. Moreover g} (z,w) should be as small as possible as z — œ. Of course, 
g, should also satisfy the differential equation 


02 2 
y? (Ht) 8a (z,w) = Àg, (z,w) for Fw. 
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Exercise 3.7.10. 


(a) Show that the Green’s function for L = 9? /9x? + d7/dy* in RÈ is k(z,w) = 
(1/27) log|z—w; i.e., 


eg] k(z,w) f(w) du dv. 
w=u+iveR2 


Hint. Use Green’s theorem or Fourier transforms (see Courant and Hilbert 
[111], Garabedian [197], or Vladimirov [706]). Compare with Exercise 1.1.6 of 
Sect. 1.1 and Example 1.2.2 of Sect. 1.2. 


(b) Explain why the non-Euclidean Green’s function g} (z,w) for (A—A) on H 
should have the same singularity as k(z,w) as z approaches w. 


Hint. See Lang [389, pp. 276-280]. 


Because g,(z,w) is G-invariant, we can move w to i. Also, gy (z,i) is a function 
of d(z,i) =r = the geodesic radial coordinate of z = ke~"i, for k € K = SO(2) and 
r > 0. Then, by separation of variables applied to 


a a 
y (3 + Z) galz w) =Aga (zw) for z=x+iy#w=u+iv, 

we find as in Exercise 3.2.11 of Sect. 3.2, that solutions which are functions of the 

radial variable alone have to be of the form 


f(r) =AP_;(coshr) + BQ_;(coshr), A = s(s— 1). 


Here P_,(u) and Q_,(u) are the Legendre functions of the first and second kinds, 
respectively (see Lebedev [401, Chap. 7]. In order to make f(r) behave as r > ©, 
we choose A = 0. In order to obtain a singularity (1/27) log|z—i| as z — i or r > 0, 
we need the following exercise. 


Exercise 3.7.11. Show that if z = ke~"i, with k € K = SO(2) and r > 0, then 
log|z—i| ~log(coshr—1) as r>0+. 
Hint. From Exercise 3.7.8, we have 


o |z+il? 


2coshd(z,i) +2 
coshd(z,i) + Imz 


Thus, 


=i? 


2coshd(z,i)— 2 ‘ 
coshd(z, i) Imz 
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Next note (see Erdélyi et al. [164, Vol. I, p. 163]) that if s A —1, —2, ..., 
1 
Qs—1(coshr) ~ — 5 log(coshr — 1) asr> 0+. (3.148) 


Thus, in order to obtain the correct singularity, the (free-space) Green’s function 
for A— A on H must be 


ga(z,w) = = o, 1(coshd(z,w)), A =s(s— 1). (3.149) 


We can check this against Elstrodt [156, p. 67], where g, is given as follows, once 
we note that Elstrodt sums over I and not T/ {+7} and replaces A by —A: 


E -T(s)? /u+1\ _ 2 
ga (z,W) = 2nT (2s) ( 7 ) F (s.s2s =) 4 (3.150) 


if u = coshd(z,w), A = s(s — 1), and F(a,b,c;z) is the Gauss hypergeometric 
function (see Lebedev [401, Chap. 7]). The reason is that, from Lebedev [401, p. 3 
and p. 200], we have 


=] _ —1 T(s) i -=s S.S oo 
z 2-14) = Jao Tis+ 3)" 1) F( BA a) 


—1T(s)? a 
= Oi Gaal. F ae : 
2a T(2s) 2 u+1 


Next we use formula (3.147) to build up the Green’s function for A — AJ on the 
fundamental domain T \ H. 


Exercise 3.7.12. Show that the series (3.147) converges for (z,w,s) in compact sets 
inside the domain H x H x {s|Res > 1}. 


Hint. You need to know (see Lebedev [401, p. 176]) that 


Jnl (s+1) 


Q;(z) ~ Tis+ Qo 


as |z| => œ, s#—1, —2,.... 


This allows you to bound the series (3.147) by a constant (dependent on s) times 
the non-Euclidean Eisenstein series or Epstein zeta function 


Zr(z,w;s)= > (coshd(z,yw))~. (3.151) 
yel/{+1} 


Note that Eq.(3.151) is the Mellin transform of the theta function defined in 
Eq. (3.139); i.e., 
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T(s)Zr(i is) = | a’! @p(a)da. (3.152) 
a=0 
You might find it hard to believe that the series (3.151) converges for Res > 1, 
since Epstein’s zeta function in Sect. 1.4 given by 


Zias)= Y (+b 4+?4+d*)5 
(a,b,c,d) £0 


only converges for Res > 2 (see Exercise 1.4.5 and Theorem 1.4.1 of Sect. 1.4). 
However, there are many fewer terms in the sum for Zr(z,w;s), as we saw in 
Eqs. (3.137) and (3.138). In fact, one could use Eq. (3.137) to do Exercise 3.7.11 
and the fact that Zr(i,i;s) has a pole at s = 1 to prove Eq. (3.137) via a Tauberian 
theorem for Dirichlet series. 

One can study the convergence of Eq. (3.151) using an integral test. For one has 


_s du dv 
y2 


£ (coshd(i, yw)) 


cl yer {1} 
w=utiveT\H coshd(i,yw)>1 


> 2n | (coshr) *sinhr dr, 
coshr>1 


which is finite if Res > 1. 


Exercise 3.7.13. Apply the non-Euclidean Poisson summation formula of Theo- 
rem 3.7.3 to Zr(i,i;s) in formula (3.151), or Mellin transform formula (3.142). 


It follows from Exercise 3.7.12 that the analytic continuation of the Green’s 
function G} (z,w) in A = s(s — 1) to Re s = 4 is necessary in order to reach the 
spectrum of the non-Euclidean Laplacian. There are many ways to approach the 
analytic continuation of G}. We shall not go into this here, beyond giving a few 
more exercises. One wants this analytic continuation because it provides another 
proof of Theorem 3.7.1 (see formulas (2.24)-(2.26), since one can deduce the 
spectral theory of A on \H from the behavior of G}. The poles of G} correspond 
to the discrete spectrum of A, and the jump discontinuities as A crosses the real axis 
correspond to the continuous spectrum. Recall here that real A = s(s— 1) correspond 
to s with Re s = 5 or s € [0,1]. 


Exercise 3.7.14. 


(a) Show that G} (z,w) given by formula (3.147) is an L?-function and thus has an 
L?-expansion 


= a Vn(Z)Vn(w) 1 E;(z)Es(w) 
G, (z,w) => A — An t Jri ka I ds. 
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Note that this gives a non-Euclidean Poisson sum formula for G} (z,w). 
(b) For absolute convergence, show that one should look at G} — Gy, since 
(A — u)R}Ru = Ry — Ry, if R} = (A — Al)! =the resolvent. 


Exercise 3.7.15. Since G% is T-invariant in z = x + iy, it has a Fourier expansion as 
a periodic function of x = Rez. Show that this expansion is 

ys 
2s— 1 


—G,(z,w) = E;(w) +> F,(w38)V/YK,_ 1 (22|n|y) exp(27inx), 


n#0 


with 


F,(w;s) = 5 Im(yw) L-i (272|n| Im(yw)) exp (2min Re(yw)). 
yeTo\T = 


Here /,(z) denotes the J-Bessel function (see Lebedev [401, Sect. 5.7]). One must 
keep Imz large compared with Im(yw), y €T, in this expansion. 


Exercise 3.7.15 helps to explain Hejhal’s work [263], discussed briefly in 
Sect. 3.5, showing that spurious eigenvalues A = s(s— 1) of Aon T\H correspond 
to zeros of Epstein’s zeta function: 


Z(W,s) = 6 (2s)Es(p) = 6(s)L(s,(—3/*)), 


with 


p =exp(2ni/3) =(—1+iV3)/2, W= = € =) 


For at zeros s of E,(p), the Green’s function G,(s—1)(z, p ) looks like a cusp form, by 
Exercise 3.7.15, except that it has a logarithmic singularity at z = p. 

Hejhal [264] uses Fourier-type expansions of G,(z,w) coming from certain 
hyperbolic matrices to show that the solutions of the congruence 


y? =5 (modL) 


are such that y/L(mod 1) is uniformly distributed as L approaches infinity, and that 
the finer properties of this distribution are ruled by the eigenvalues of A of TH. 
The result on distribution of solutions of quadratic congruences was first proved 
by Hooley [301]. The generalization to higher-degree congruences remains open. 
However, it is possible that extensions of Hejhal’s Green’s function identities to 
SL(n,Z) might allow one to attack such questions, especially if one recalls the 
work of Dorothy Wallace [711] connecting units in higher-degree number fields and 
hyperbolic elements of SL(n,Z). See Hejhal [267-271], and Fay [176] for related 
work. 
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Exercise 3.7.16 (The Heat Equation on '\H). Given some initial heat distribu- 
tion function f on TĦ, find u = u(z,t) satisfying 

u = Azu, A; = y"(0"/dx° +0°/dy"), 

z=x+ iy, and u(z,0) = f(z), for z € TĦ, t > 0. 


Answer. Using the notation (3.129) and (3.130), 


t) = X Anexp(Ant)vn(z) + = Ia _Asexp{s(s— 1)t}Es(z)ds 


n>0 


where A, = (f,Vn), and As = (f,Es). 
One way to see this is to use the method of images to show that the fundamental 
solution for the heat equation on T\H is 


e*(zt)= LY aly), 


yer {1} 


&: = the fundamental solution for the heat equation on H itself given by formu- 
las (3.37) and (3.38) of Sect. 3.2. Apply the non-Euclidean Poisson sum formula to 
see that 


1 ; — 
—_ 1 oY DE, (2)ds 
Res=4 


4ri 


where Àn = Sn(Sn— 1). 


Exercise 3.7.17 (Remarks on Convergence of Non-Euclidean Fourier Series). 
Suppose that Af € L?(T\H). Show that the non-Euclidean Fourier “series” for f 
(see Theorem 3.7.1) converges uniformly and absolutely. 


Answer. Imitate the proof of part (3) of Theorem 1.3.1 of Sect. 1.3. Look at 


Frm + Fag fi IEE ds 
(Af, Vn )Vn(z) 


n>0 
= f 
n>0 Sn(Sn— 1) Ami JRes=} s(s— 1) 


1 
2 
Wh) 2 
(zl a LT) (Zere) 
n>0|Pn\en n>0 
Es 


r (Hha fe Fas) = KArE)Pas) 


Then use the fact that Af and G, = the Green’s function for A— À are both square- 
integrable. 


AF JE) | 4, 


II 


IA 


Nie 
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There is another application of Theorem 3.7.1, which was given by Kaori 
Imai Ota [315] showing that there is a higher-dimensional analogue of Hecke’s 
Theorem 3.6.1 of Sect. 3.6. We shall discuss this in more detail in Volume IT ((667]). 
Briefly, one wants higher-dimensional Mellin transforms of Siegel modular forms 
for the symplectic group Sp(n,Z). When n = 2, Imai Ota shows that the converse 
theorem in Hecke theory can be proved using the Roelcke—Selberg spectral resolu- 
tion of the Laplacian on L?(SL(2,Z)\H). When n is larger than 2, this converse 
theorem requires a workable generalization of Theorem 3.7.1 for SL(n,Z) — a 
subject to be discussed in Volume II [667]. 

Elstrodt et al. [157—160] and Mennicke [463,464] have used the non-Euclidean 
Poisson sum formula for groups such as SL(2,Z/i]) (to be discussed in Volume 
II [667]) in order to derive many sorts of algebraic and analytic results. One very 
interesting idea of Mennicke [464] is to use Theorem 3.7.3 in order to study the 
eigenvalues of the Laplacian corresponding to Maass-type cusp forms, as well as to 
study the cusp forms themselves. 


3.7.4 Selberg’s Trace Formula for SL(2, Z) 


The discussion of Selberg’s trace formula should begin with a review of the part of 
Sect. 1.3 which gave an interpretation of Poisson’s sum formula as a trace formula. 
However, the Selberg analogue of Poisson summation is complicated by the lack of 
commutativity of the groups G and T involved, as well as the lack of compactness 
of the fundamental domain. This section was greatly influenced by the work of 
Dorothy Wallace [711,712]. 

We wish to find traces of the“compact part” of the convolution operators in 
Eq. (3.134) and Lemma 3.7.2, defined by 


Lef(a) = ig ee aa (3.153) 


viewed as integral operators on functions f in L?(I'\H), for K-invariant functions 


gE€C7 (H). We can rewrite Eq. (3.153) as an integral operator on the fundamental 
domain as follows: 


Lef(a) = f. opf OKelasb)ab, 


K,(a,b) = ¥ g(b"'ya), T =T/{1} = PSL(2,Z). (3.154) 
yer 


Note that we must use T’ = T’/{+/} = the projective linear group, or the 
formula will be off by a factor of 2. For T\H =I \H, since the fractional linear 
transformation corresponding to —y is the same as that for +y, and 
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H = U y(D), D = a fundamental domain for TĦ, 
yer 
is a disjoint union. 
We will write 

Kg (a,b) = Kọ (a(i), (i), 

kg (a,b) = ke(a(i),b(i)) = g(b™'a), (3.155) 
for a,b € G = SL(2,R). Here, as usual, a(i) is the point in H to which the fractional 
linear transformation corresponding to a € G sends i= /—1. 


In this setting, using the notation in Eqs. (3.129) and (3.130), involving vy, Sn, 
and E,, Mercer’s theorem (see Theorem 1.3.8 of Sect. 1.3) becomes 


RK, (a,b) = Dns08(Sn)vn(a(i))vn(bO) (3.156) 
E,(a,b) = a Jres-} 8(S)Es(a(i) )Es(b(i)) ds 


Here ê denotes the Helgason transform of Sect.3.2 (which is also called the 
Selberg transform in this context): 


8s) = g(z)y" ? dx dy, (3.157) 


for K-invariant functions g. Formula (3.156) is equivalent to the non-Euclidean 
Poisson summation formula in Theorem 3.7.3 above. 
Next define the integral operator Ê, corresponding to Ky by 


Ê fla) = i f(b)Kg(a,b)db. (3.158) 
T\G/K 
Our discussion of the Selberg trace formula begins by writing 


Trace Ê, = a(n) = | K,(a,a) da. 3.159 
= >, (sn) a (3.159) 


n>0 
In order to proceed further, we must decompose SL(2, Z) into conjugacy classes of 
various sorts, according to the Jordan forms (over C) of these matrices. 
Definition 3.7.2. Classification of Elements of SL(2,Z) =T (cf. Sect. 3.3). 
(1) Central Elements: +/,—/. 
(2) Parabolic Elements: those having Jordan form + (G J , with a £0. 
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(3) Elliptic Elements: those with Jordan form & i ), with a ¢ R, |a| = 1. 
a 


(4) Hyperbolic Elements: those with Jordan form ( j i, with a € Ra F 


a 
0 1/a 
1,—1,0. For a hyperbolic element y, we define a? = Ny, the norm of the 
element, where a is chosen so that |a| > 1. 


Definition 3.7.3. For œ €T, the conjugacy class of œ in T is 

{a}={yay'| yer}. (3.160) 
Definition 3.7.4. For œ € T, the centralizer of œ in T is 

Te ={yET | ya=ay}. (3.161) 


Exercise 3.7.18 (Explicit Characterizations of the Conjugacy Classes in SL(2, 
Z)). 


(a) Show that the conjugacy classes of parabolic elements of SL(2,Z) are repre- 


sented by 
la 
E Z. 
(4 ‘) , ae 


(b) Show that the conjugacy classes of elliptic elements of SL(2,Z) are repre- 


sented by 
1 O1\ ,/f1-1 
~\-10/'"A\1 0) 


(c) Show that the conjugacy classes of hyperbolic elements of SL(2,Z) are repre- 
sented by units £4 in orders 9 in real quadratic fields Q(vd ), with multiplicity 
hq = the narrow class number of the order ©. The narrow class number is 
defined similarly to the class number in Sect. 1.4, except that the equivalence 
relation between two ideals a, b in Z[e,] is 


a~b&a=cb force Q(Vvd) withnorm Ne = +1. 


The order 9 need not be the whole ring of integers in Q(V/d). 


Hints. See Olga Taussky [659, 660], Shimura [589], Schoeneberg [562], and 
Dorothy Wallace [711,712]. It helps to look at the points in H UR which are fixed 
by y €T. Part (c) is the most complicated. Following Taussky [loc. cit.] and Wallace 
[loc. cit.], you should note that if y is a hyperbolic element of SL(2, Z), the diagonal 
elements £ in the Jordan form of y must be units in a real quadratic field K, since € is 
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a root of the characteristic polynomial of y. The eigenvectors w1, w2 in K? generate 
an ideal Zw, © Zw in an order © of K. Note that this order need not be the maximal 
order; i.e., the whole ring of integers of K defined in Sect. 1.4. However, one can still 
form the narrow ideal classes (which do not necessarily form a group). And one can 
relate the number of these narrow ideal classes to the ordinary class number of K 
(see Lang [390]). Sarnak [555] gives a description of part (c) of this exercise in the 
language of quadratic forms which goes back to Gauss. 


The following exercise shows that the concept of primitive hyperbolic element 
of SL(2,Z) is not the same thing as that of fundamental unit in the relevant real 
quadratic number field. 


Exercise 3.7.19. 


(a) Let y be a hyperbolic element of I = SL(2,Z). Show that the centralizer Ty of 
y in T is an infinite cyclic group generated by % which is called a primitive 
hyperbolic element of T. 


(b) Show that 6 a is a primitive hyperbolic matrix in I with eigenvalue 


z = (743V5)/2, where z = (1 + V5)/2. 


Hint (see Buser [74, p. 228] or Hejhal [261, Chap. 1]). Use the fact that T 
is discrete as well as the possibility of simultaneously diagonalizing commuting 
elements ofT. 


Exercise 3.7.20 (Hyperbolic Conjugacy Classes and the Length Spectrum of 
Closed Geodesics in the Fundamental Domain). 


(a) Let C(z,w) be a geodesic line or circle in H connecting two points z,w on 
RU {co}. Consider the image C(z,w) in the standard fundamental domain for 
SL(2,Z)\H which is given in Fig. 3.14. We say that C(z, w) is a closed geodesic 
if it is a closed curve in the fundamental domain. Here we mean “closed” in 
the sense that, once correctly parameterized, the beginning of the curve is the 
same point as the end of the curve. We do not refer to the topological notion 
of closed (i.e., the set of points on the curve has open complement). Show that 
C(z,w) is a closed geodesic in SL(2,Z)\H if and only if there is an element y of 
SL(2,Z) such that yC(z,w) C C(z,w) (as sets of points). Show then that C(z, w) 
is a closed geodesic if and only if z and w are the fixed points of a hyperbolic 
element y of SL(2, Z). 

(b) Suppose that C(z,w) is a closed geodesic in SL(2,Z)\H, with z,w the fixed 
points of a hyperbolic element y of SL(2,Z). Show that if a point q lies on 
C(z,w), then so does yq and the hyperbolic distance between q and yq is log Ny, 
where Ny is the norm of y, from Definition 3.7.2. 

(c) Using a computer, graph C(z,w) for various choices of z,w. We did this in 
Figs. 3.42-3.45 using the image of the standard fundamental domain under 
z>-1/z. 
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0.5 0.5 


Fig. 3.42 Images of points on the geodesic circle of center 0 and radius 5 after mapping by 
SL(2,Z) into the image of the standard fundamental domain under z + —1/z 


It follows from Exercises 3.7.18—3.7.20 that the primitive hyperbolic conjugacy 
classes in SL(2,Z) have both a number-theoretic and a geometric interpretation. 
The number-theoretic interpretation is that they correspond to fundamental units 
in orders in real quadratic number fields. The geometric interpretation is that they 
correspond to closed or periodic geodesics on the Riemann surface SL(2,Z)\H— 
the length spectrum of SL(2,Z)\H. The number of hyperbolic conjugacy classes 
with a given trace can thus be viewed either as a class number or as the number 
of closed geodesics with a given length. Selberg’s trace formula (Theorem 3.7.4) 
gives relations between the length spectrum and the eigenvalue spectrum of A 
on SL(2,Z)\H. This results in an analogue of the prime number theorem (see 
Theorem 3.7.6), which can thus be given either a number-theoretic or a geometric 
interpretation. We can also view the hyperbolic conjugacy classes as corresponding 
to non-Euclidean billiards on SL(2,Z)\H such as that pictured in Fig. 3.43. 

In Sect. 1.5.4, we considered Weyl’s criterion for the equidistribution of numbers 
on a circle R/Z or the torus R*/Z?, showing that a line in R? which makes an 
irrational angle with the x-axis will correspond to a densely wound line in the torus 
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-0.5 0.5 


Fig. 3.43 Images of points on the geodesic circle of center 0 and radius (15/2)'/? after mapping 
by SL(2, Z) into the image of the standard fundamental domain under z > — 1 /z 


R? / Z?. In Exercise 3.6.17 of Sect.3.6, we saw that the image of a horocycle at 
height y in SL(2,Z)\H tends to fill up the fundamental domain as y + 0+. 

One can ask analogous questions about geodesics. Before asking these ques- 
tions, the reader should consider Figs. 3.42-3.45, which give plots of points on a 
fundamental domain in various geodesic circles C(—x, +x). Figure 3.42 shows the 
geodesic C(—5,+5). This geodesic has only a finite number of segments and is 
clearly not dense, but it is also not closed. Figure 3.43 shows a genuine closed 
geodesic C(—x,+x) for x = \/15/2, which is fixed (as is —x) by the hyperbolic 


matrix 
11 30 
4 11) 


Figures 3.44 and 3.45 show points on approximations to C(—x, +x) for x = v 163 
and e = 3.1415.... In the first edition of this book the analogous plots were made 
with a TRS-80, Model 100 computer. Here I used my 2006 HP PC and Mathematica. 
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Fig. 3.44 Images of points on the geodesic circle of center 0 and radius (163)!/ 2 after mapping 
by SL(2, Z) into the image of the standard fundamental domain under z + — 1 /z 


Of course, the problem of drawing these pictures for geodesics with irrational radius 
is similar to the problem one has when drawing fractals: Computers have a fixed 
precision. And so does the human eye. 


Questions. 


(1) As the length of a closed geodesic approaches infinity, does the geodesic tend 
to fill up a fundamental domain, in an analogous way to that for the horocycle 
of Exercise 3.6.17 of Sect. 3.6? 

(2) Can one give a criterion on z,w which will insure that a geodesic C(z,w) is 
dense in a fundamental domain for SL(2,Z)? 


These questions are related to ergodic theory (as well as continued fraction 
expansions of real numbers). Let T : M —> M be a measure-preserving transfor- 
mation of a Riemannian manifold. One says that T is ergodic if for every Lebesgue 
integrable function f on M, one has 
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-0.5 0.5 


Fig. 3.45 Images of points on the geodesic circle of center O and radius e after mapping by 
SL(2, Z) into the image of the standard fundamental domain under z + — 1 /z 


n-1 
lim : 5 f(T*x) = f f(y)du(y) for almost all x € M. 
P K=0 K 


n—oo 


The average on the left may be considered a time average, while that on the 
right may be considered a space average. Actually, M is often replaced by its 
unit tangent bundle in most works on the subject. References for ergodic theory, 
geodesic flows, etc., are Artin [14], Auslander et al. [18], Bedford et al. [32], 
Bowen [60], Gelfand and Fomin [202], Mautner [460], Moeckel [472], and Nicholls 
[495]. Svetlana Katok and Ilie Ugarcovici [343] survey some of the results on 
coding closed geodesics in SL(2,Z)\H. In particular, they give various methods 
to code the geodesics with bi-infinite sequences of integers. The geometric method 
records the sides of a given fundamental domain cut by the geodesic. The arithmetic 
codes use various types of continued fraction expansions of the endpoints of the 
geodesics or their reciprocals. If you write programs to draw your own versions 
of Figs. 3.42-3.45, you will appreciate the appearance of continued fractions here. 
Other references for the distribution of geodesics in the fundamental domain of the 
modular group are: Duke [144] and Einsiedler et al. [154]. 
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To return to our discussion of the trace formula, first suppose that I has 
no parabolic elements. Then we can write the trace as follows, using formu- 
las (3.154), (3.155), (3.159)-(3.161): 


Y 8(sn) 


n>0 T\H 


K,(b,b)db 


i gla! o~l yoa)da (3.162) 
M _oery\r’l\ 


distinct in T 


= } f g(x! yx)dx 
{7} Le 


distinct in T 


Exercise 3.7.21. Use the relation between K, and kę given in formulas (3.154) 
and (3.155) as well definitions (3.160) and (3.161) to prove formula (3.162) for 
groups I’ with compact fundamental domain. 


However, in the case T = SL(2,Z), there are parabolic elements and thus we 
must cancel their contribution to the trace formula against that of the continuous 
spectrum. This allows us to write the trace formula in the following form: 


D &(sn) = cgl) + x cg(Y), (3.163) 
n20 (7 
distinct nonparabolic 


conjugacy classes in T 


where the orbital integrals c,(y) are defined by 


cg(y) = f g(a`™'ya)da fory not parabolic, (3.164) 
Ty\G/K 
and 
A i 
, | dx dy dx dy 
cele) = lim 4 f E aad o f nen 
y=0 x=} ln aoe 
01 yS 
neEZ—0 
(3.165) 


Here E, is defined by formula (3.156) and T\H means the standard fundamental 
domain for SL(2,Z) =T (as in Fig. 3.14). 
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In order to put the trace formula in final form, Selberg showed how to evaluate the 
various c(y) above in terms of ê or a closely related integral—the Harish transform 
(or Abel transform or horocycle transform) defined by 


alge if a(x-+iy)de = G(logy). (3.166) 


We know from Exercise 3.2.21 of Sect. 3.2 that 
A S 
êls) = MTg (s-3) ; (3.167) 


where M denotes the Mellin transform. 

Now we are ready to prove Selberg’s trace formula for SL(2,Z). It should be 
compared with the result of Exercise 1.3.11 which gives a trace formula on R? / 7? 
and with the finite analogue stated in Sect. 3.7.8. 


Theorem 3.7.4 (Selberg’s Trace Formula). Suppose that g : H —> C is in 
C>(K\G/K); i.e., g is a compactly supported infinitely differentiable K-invariant 
function on H. Then,using the notation (3.157), (3.160), and (3.166), we have 


aa (; r) 
(Sn) = ——— ê| —+ ir) rtanh(zr) dr 
2a n) te Jans (xr) 
logN 
$ Glog) 
{m} k>1 NY -NH 
primitive 
hyperbolic 


+f (j + 7] & att ar 
—G(0)log(2z) + — TA (5 +ir) e 2ir) dr. 


Proof. We begin with formulas (3.163)—(3.165). It remains to evaluate the various 
orbital integrals in terms of the Helgason and Harish transforms of g. 


Central Term. We seek to prove 


area(I\H) 
4r reR 


C(I) = al; + ir) rtanh (zr) dr. (3.168) 


It follows immediately from the definitions that 


c= [ „8 Ha)da = g(1) area (T\H). 


3.7 Harmonic Analysis on the Fundamental Domain 353 
The inversion formula for the Helgason transform of a K-invariant function 
[formula (3.34) completes the proof of formula (3.168)]. 


Hyperbolic Term. Suppose that y = Y, where % = a primitive hyperbolic in T. 
Then we want to prove 


| 
BN __G(klogNy). (3.169) 


cg(y) = =K/2 


NY -NY 


Use Exercise 3.7.19 to see that if y is hyperbolic then 
ry,={ % | keZ} with W% = primitive hyperbolic. 


Then replace y by EyE~!, é € G, with EyE—! diagonal. Also replace T by EVE~!. 
Thus, in our computation of cg(7), we can take 


a 0 


Ty= {| ke2}, w= (4 fa)” a = Ny >I. (3.170) 


We will need the following exercise. 


Exercise 3.7.22. Show that if Ty is given by Eq. (3.170), then we can choose a 
fundamental domain for I'y\H to be 


{z€H|l<y< Ny}. 


As usual, we identify z in H with ayy € SL(2,R) via 


' 1 
z=x+iyeH E i iy p] ESOR). (3.171) 
Therefore, if y = ae = oe” , with a > 1, we have 
0 I/a 


cg(y) = [ yp Stan aay) dx dy. 
Y 


Next note that 


1 _ (1-x/y\ (a 0 1 x/y 
acs (4 1/\oa*/\o 1/° 


So we can make the change of variables x > u = x/y, du = dx/y to obtain 
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=NH 1 —u a 0 lu L1 
cal) te (C “) 6 E E J EP 
= logo f g (a*(i+u) —n) du. 


Set w = (a** — 1)u to find that formula (3.169) is indeed correct. 


Elliptic Term. If yọ is an elliptic element of I with Tryg = 2cos@, then 0 = 7 /2 
or 2/3. Let mo be the order of the centralizer Tyo = Tyo / {+1}. Then mz/2 = 2 and 
My /3 = 3. We shall prove that 


cosh[(z—20)r] . ( 1 


1 
ir) dr. 3.172 
nma cosh (7r) : +) í i i 


cg(Yo) = 


Discussions of this term appear in Hejhal [261, Vol. I, pp. 351 ff] and Kubota [375, 
pp. 99 ff]. Lang [389, pp. 166-167] gives relations between elliptic and hyperbolic 
orbital integrals, which may be of use in simplifying the proof. 

Upon replacing yọ by a conjugate, we can assume that 


_ cos@ sin@ cK 
Me —sin@ cos 
Then we use geodesic polar coordinates from Sect. 3.1 on the orbital integral, plus 
the fact that K = SO(2) is abelian, to see that (if m = me) 


1 — 
Cg(Ye) = mn eet lyx) dx 
2 —v/2 
-= g(a_yYeay) sinhv dv, ay = T 2) l 
e 


m Jy>0 0 


Next we want to make use of Fourier inversion on G/K [see formulas 
(3.32)-(3.34)] which says that 


1 1 
pa. d ee 
g(ke Pi) = tn [8 +ir)P_1, ;,(coshp) rtanh(zr) dr. 


In order to plug this into the elliptic term and evaluate the result, we must remark 
that if a_, yea, = ki a,(9)k2, with k; € K and ay, ay(g) as above, then 


cosh v(@) = (1 — 2sin? 0) + 2sin? @ cosh? v. (3.173) 


It follows that the proof of Eq. (3.172) will be completed by the following exercise. 
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Exercise 3.7.23. Show that if v(@) is defined by Eq. (3.173), then 


cosh[r(a — 20)| 
2| sin @|rsinh (zr) 


P_1,,,(coshv(@)) sinhv dv = 


v>0 


Hints (see Sneddon [609, pp. 384-387] or Erdélyi et al. [165, Vol. IT, p. 330]). 
Make the change of variables from v to w = coshv(@). Then you must compute the 
Mehler—Fock transform 


I (w— cos20)""/?P_ 1 ,,(w) dw, assuming cos(20) > 0. 

wrl 

To compute such a Mehler—Fock transform, you can use the following formulas. 
(a) 


f g(t)P_1,;,(cosha) tanh (zt) dt = nef (cosht — cosha)~!/?F g(t)dt, 
0 


a 


where 


Pps g f ” g(v) sin (tv)dv. 


(b) When g(v) =cosh(bv)/sinh(zv), —1 <b < r, we have 


1 sinht 
,/2n cosht-+cosb 


Fg(t) = 


(c) 


a sinht dt _ T 
a (cosht+cosb)ycosht—cosha vcosha+ cosb 


Parabolic Term. We must show that 


/ 


Ce(ce) = —G(0) log(2z) + L f2 G + ir) (2i) dr. (3.174) 


Begin by noting that conjugacy classes of parabolic elements of PSL(2,Z) are 
represented by y,(z) =z +n, n € Z, with centralizer 


Ty, ={Y%m | MEZ}. 


Clearly 
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1 
MAH = fb bs g.yoh. 


Then we define, as in Eq. (3.165), cg(e) = limg_,..{T) (A) — To(A)}, where 


Ap In dx dy 
nA) = [ | («3 ( Ja ) , 
1( ) y=0 18 Wy 01 x,y y 

1 


. dxd 
n(A) = = I 8(s)Es(z)Es(z) ds > (3.175) 
Res=4 
w+y?>1 
Ixl<4, y<A 


rare lx y20 
os 01 0 y2 ` 


We shall follow Selberg’s Göttingen lectures [573, p. 79]. This method should 
be compared with that of Kubota [375, p. 102 ff]. One should also consider the 
method of Warner [718]. See the article of Gelbart and Jacquet in the Corvallis 
conference proceedings (Borel and Casselman [52, Vol. I, p. 245]) for an expression 
for a parabolic (or nilpotent) orbital integral as a limit of hyperbolic orbital integrals. 
Barbasch [23,24] pursues this matter for general Lie groups. 


Let us start with T; (A). Note that 
ln y? 0\ /1 n/y 
O1/\o yz} \o 1)’ 


1 
fin “2 0 
a, e a ax,y = 7 y2 


Tı(A)= J g (i+ *) = = X, gli+nw)dw, 


Thus 


yaa” w=1/a"re 
setting w = 1/y. It follows that 
oo Au 1 
Tı (A) = 2f g(it+u) © = du. (3.176) 
0 n=l"! 


Formula (3.176) is a reasonable one itself, but we want to express everything in 
terms of G or g. So we make use of the following exercise. 


Exercise 3.7.24. Show that 


1 1 
5 ~=logs+7+0 (=) as x— o, 
0<n<x 
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where y =Euler’s constant = 0.57721566.... 


Hint. This can be proved using the Euler-Maclaurin summation formula (see 
Edwards [149, p. 285]). 


It follows from Exercise 3.7.24 and formula (3.176) that 
Tı (A) = (logA + y)G( 0+2 f logu g(itu)du+o(1) asA—o. (3.177) 


To complete the evaluation of T (A), we need another exercise. 


Exercise 3.7.25. Show that 


2 f logu g(it+u) du = —(log2+ y)G(0) 
0 


Hint (See Selberg [573]). 
Step 1. Let g(axy) = hg (Tr(‘ax.yaxy) — 2), if axy is as in formula (3.175); i.e., 


g(x+iy) = hg (-2+ : +) : 
Next let 
v)= f hwe du 
Then by Exercise 3.2.21 of Sect. 3.2, we have 
hg( =z. H; (v+ w? /2) dw 


We can relate H, and the Harish transform (3.166) of g as in Exercise 3.2.21 of 
Sect. 3.2: 


1 1 14x 
eyo) G(lo -—| hy (9-24 ) ax. 
aie y ) = G(logy) Pla Y 7 


Step 2. So we must deal with 


oo 1 i 5 w2 
a logu hg(u *)du=2 f ogu (= [an (« +3) aw) du. 


Set u? +w?/2 =v, and the integral becomes 
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dv du 


2 Í = 
—— logu H’! (v : 
nvV/2 Jo : v>u? al ee 
Next reverse the order of integration and obtain 


2 2 vv du dv 
E E H! » f lo u ——.. 
nvV2 Jv=0 a”) ae Vv=u? 


Then set u = x/v to turn the integral into 


dx dv 


2 ee 1 _ 
—— H! (v I logy v +logx 
n/2 v=0 al ) a gv E ae 


: a H'(v) logy dv —log2 G(0) 
= = v)logvav—1o re 
2/2 vo S j E 


For the last equality, you can use Gradshteyn and Ryzhik [228, formula (3.452) on 
p. 335]. 
Step 3. Note that 


fiogu H; (u)du = f oge te™ — 2) V2 G' (x) dx 
0 0 
= vaf xG'(x)dx-+2V2 | log(1 — e™*)G' (x) dx 


sya G(x)dx +2V3 f log -e'a dx 


x=0 
—1 1 2 
= —ĝ| = gefi 1 — e™)G' (x) dx. 
e(a) +297 oea- eoo) as 
Step 4. 
f log( = eG! x)dx= 7G(0) + 5 f slarl Ga 
x=0 20 reR 2 T 


To see this, use 


and 
oo , T” 
irf e log(1—e*)dx = FC —ir)+ y, 
x=0 


from Erdélyi et al. [165, Vol. I, p. 316]. This will complete Exercise 3.7.25. 
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Finally we can use formula (3.177) and Exercise 3.7.25 to obtain 


A l 1 1 stl a : 
T,(A)= =G(0)log5 +78 3 z bad zË 7p (l- ir) dr+o(1), as A> œ. 
(3.178) 


Now we turn to the second term in cg(ce) and use Exercise 3.5.12 of Sect. 3.5 to 
show that as A — œ, we have 


/ 5 2ir s —2ir 
RA) =p J g) (21084 ZA a =o Ja ) ar+ott), 


reR 
s=htir 
(3.179) 
where 


p(s) =A(1—s)/A(s) and A(s) = n ™“T(s)¢ (2s). 
To evaluate this, we need the following exercise. 


Exercise 3.7.26. Show that 


+ / as) (eet on™) dr= La (5) 9 G) +o(1), as A — o, 


reR 
s= 4 +ir 


Note that @ (5) =—-l. 


Hints. You could use the residue theorem as in Selberg [573], or beginning facts 
about Fourier integrals (see Sect. 1.2 and Kubota [375]). 


It follows from Eq. (3.179) and Exercise 3.7.26 that 


na = ES f ERRAT z/ i area 
ON One dee KO re es ala 


s= ltir 


+o(1), as A, 


So this combines with Eqs. (3.178) and (3.165) plus (3.175) to give 


ip ft AGEs ox o i 
e) =-G(O)log2— = | el ir) (Fu ir) 20 G H ir) ) dr 


(3.180) 
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Marie-France Vignéras [700] has simplified formula (3.180) as in the following 
exercise. 


Exercise 3.7.27. Use the functional equation of Riemann’s zeta function to 
show that 


1 


cle) = —G(0)log2n-+ 1 [8 G +ir) (-2ir a 


This completes the proof of formula (3.174) for the parabolic term. Combining 
formulas (3.168), (3.169), (3.172), and (3.174) gives the formula in Theorem 3.7.4 


and completes the proof of Selberg’s trace formula, once one has done Exer- 
cise 3.7.29. 


Our discussion of the parabolic term of the trace formula was very indirect. 
It would be nice to find a simplification. In particular, all the cancellations that 
occurred during the computation of the parabolic term make one suspect that there 
is a more direct route to the result given in Exercises 3.7.24—3.7.27. 


Exercise 3.7.28. Show that 
o' 
— (s) 
| (O 


Hint. See Titchmarsh [680, pp. 50-53]. 


1 
<Clog(2+ |r|) for s= zti rER. 


Exercise 3.7.29. Show that |@(5+ir)| <C(1+|r|)~?-€, for € > 0, suffices to 
make the Selberg trace formula valid. 


Selberg’s trace formula is surprisingly similar to Weil’s explicit formulas relating 
sums over zeros of Riemann’s zeta function to sums over the primes (as well as 
analogues for Hecke L-functions of algebraic number fields). See Cartier and Hejhal 
[78], Goldfeld [217], Lang [388], Murty [488], and Weil [729, Vol. II, pp. 48—61] for 
discussions of Weil’s result, which was used by Weil to obtain a positivity condition 
equivalent to the Riemann hypothesis for Hecke L-functions of number fields. 


3.7.5 Applications of Selberg’s Trace Formula 


Our first application of the trace formula is a non-Euclidean analogue of Weyl’s 
result on the asymptotic distribution of the eigenvalues of the Laplace operator (see 
Theorem 1.3.5 of Sect. 1.3). 


Theorem 3.7.5 (Asymptotics of the Eigenvalues of A on L?(T\H)). Set 


N(x) = X dimSN(SL(2,Z),A), 


|A|<x 
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where SN (T,A) is the space of Maass waveforms which are cusp forms, as defined 
in Sect. 3.5. Then 


x area(I\H) 


ms) 4r 


X as X 4°, 


Proof. We use the same sort of argument that worked in the Euclidean case (see 
Theorem 1.3.5 of Sect. 1.3). Substitute 


in Selberg’s trace formula. Then g;(z) is the fundamental solution of the non- 
Euclidean heat equation on H (see formulas (3.37) and (3.38) and note that we 
have changed the notation slightly). 

The left-hand side of the Selberg trace formula in Theorem 3.7.4 for g = g; is 


Or(t) = $, exp(Ant), 


n>0 


where A, runs through the discrete spectrum of A on SL(2,Z)\H. Like Eq. (3.139) 
the function Õr can be considered to be a non-Euclidean analogue of the theta 
function, as we mentioned in the introduction to this section (see Molchanov [473] 
and Exercise 3.7.16). 


Exercise 3.7.30. Show that the Harish transform of the fundamental solution of the 
heat equation above is 


1 na fl. 1 t x2 
Gix)=55 fe Ê zti A or a a : 


Next we look as the various terms of Selberg’s trace formula (Theorem 3.7.4) 
with g = g;,ast > 0+. 


Identity Term. Since tanh (zr) ~ 1 as r — ©, we have 


AD f opd- (+r) rtanh (zr) dr~ Ten t—>0+. 


27 4rt 


Elliptic Term. 


-(}4+r°)t 


l | 3-3/2 3 ,-mr/3\\ & leja [T (1-372) r 
i 3 (a e EOR ae n 1 3-3/ Ptg 
Jalat e +e oar y e T raze 5 a e r 
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This term is thus negligible compared with the identity term. 
Hyperbolic Term. 


—t/ 


log Ný% ei 
Ny!/2 — Ny !/2 \/4nt 


exp{—(logNy)?/4t} = o(1), t—> 0+. 
{y} hyperbolic 


Exercise 3.7.31. Fill in the details for the stated result on the hyperbolic term. 


Parabolic Terms. Note that G;(0) = (4zt)~!/2e~*/4 is negligible compared with 
the identity term. Then use the following exercise to see that the story is the same 
for the rest of the parabolic term. 


Exercise 3.7.32. Show that 


$-(-2ir) = O(log as |r| — ©. 


Hint. Use the functional equation of € to change —2ir to 1 + 2ir. Then use 
the Euler-Maclaurin summation formula, for example, to bound €(1 + 2ir) (see 
Edwards [149, p. 183]) and Stirling’s formula to bound gamma. 


It follows from the Selberg trace formula and Exercises 3.7.30-—3.7.32 that 


0,(t) ~ area (SL(2,Z)\H)/(4at) as t>0+. 


Theorem 3.7.5 is a consequence of the Tauberian theorem given as Theorem 1.2.5 
of Sect. 1.2. 


It follows from Theorem 3.7.5 that the dimensions of the spaces SN (I, s(s — 1) 
cannot grow too rapidly with the imaginary part of s. This answers a question that 
had worried us in the discussion following Theorem 3.5.4 of Sect. 3.5. Of course, it 
still remains open whether the dimensions can be larger than one. 

It is also possible to work out error terms for the asymptotic result in Theo- 
rem 3.7.5 (see, e.g., Hejhal [261]). These applications remind one of the work of 
analytic number theorists on the distribution of primes (see Davenport [120], for 
example), using the relationship between primes and zeros of the Riemann zeta 
function derived from the Euler product for €(s) and the Weierstrass factorization 
of 55(s— 1)x~/°1'(s/2)C(s) as a product over the nonreal zeros of ¢ (s). This sort 
of analogy motivates much of the work in Hejhal [261,262]. In this trace-formula 
analogue of analytic number theory, primes are replaced by norms of the primitive 
hyperbolic conjugacy classes and the zeros of €(s) are replaced with s such that 
s(s— 1) is an eigenvalue of A from the discrete spectrum. Next we want to reverse 
the direction of the flow of information from that in Theorem 3.7.5 and use the trace 
formula to study the Ny, {y} hyperbolic. 
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Theorem 3.7.6 (Sarnak’s Theorem on the Asymptotics of Units in Real 
Quadratic Fields à la Wallace). Let £4, hg be as in Exercise 3.7.18, part 
(c); ie, the €q are fundamental units in certain orders © (not necessarily 
maximal) in real quadratic fields QV d) and hq is the narrow class number of the 
order ©. Then 


x2 
5 ligne, asx >o. 
eee log(x*) 


Proof. We take our discussion from Dorothy Wallace [711]. It should be compared 
with that in Hejhal [261, Vol. II, p. 519], Sarnak [555,557] and Subia [647]— 
each of whom chooses a different function to plug into Selberg’s trace formula. We 
choose instead the function of formula (3.141): 


fa(z) = exp[—acoshd (i, z)] = exp |- La TW) , 


for z € H, and W, the corresponding positive matrix in SP from formula (3.140). 
Then we have [by formula (3.141)] the Helgason transform 


fals) = 2(27/a)"?K;_; (a). 


And the Harish transform is 


oy al NT 
n30 "7? 
Qn logNyY 1 
~= ¥ exp[—a(Ny+Ny_')|, a — 0+. (3.181) 
1/2 _ Ny-l/2 
a F Ny!/2 — Ny- !/ 
hyperbolic 


Then, we have 


T logN% a 1 
= 2, yn | a \NY+NY \|, a> 0+. 


hyperbolic 
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It follows from Exercise 3.7.18, since the terms from the primitive hyperbolics 
dominate the rest, that 


l 24e? 
~En Og Eq ee ea ), a — 0+, (3.182) 
ez" 


where the sum is over the fundamental units of the theorem. 
Exercise 3.7.33. Prove formulas (3.181) and (3.182). 


Hint. Replacing the sum over all hyperbolic conjugacy classes with that over 
primitive hyperbolic conjugacy classes is a standard argument in analytic number 
theory (see, e.g., Lang [388, p. 159]). 


One must finally make a Tauberian argument to complete the proof of Theo- 
rem 3.7.6. This time the argument is more complicated than that in the proof of 
Theorem 1.2.5 of Sect. 1.2. This argument in given in Exercise 3.7.34. 


Note that by Exercise 3.7.20, Theorem 3.7.6 can also be viewed as the prime 
geodesic theorem, giving the asymptotics of the length spectrum of closed geodesics 
on the Riemann surface SL(2,Z)\H. 


Exercise 3.7.34 (H. Stark). Complete the proof of Theorem 3.7.6 using the fol- 
lowing hints. 


Step 1. The gz" appearing in Eq. (3.182) can be thrown away since £4 — œ as 
d — œ. 
Step 2. To apply Theorem 1.2.5 of Sect. 1.2, set 


log Ed 
a t) = x ha , 
2 Ed 
E d <t 
and obtain 


a(t) ~ 2t!/2 as t approaches infinity. 


Step 3. Change variables via t = v and obtain 


loge 
£ ha BED a 2v as v approaches infinity. 
Eq SV Eq 
Step 4. Given any ô > 0, only the terms x! < g4 < x matter and thus you obtain 
ha 


2 
A(x) = sw a, as x approaches oe. 
eac Ed logx 


Step 5. Use integration by parts to see 


Eq * 2 fs 
hq = h t dA(t 
È a= 2 doa [ ()~ rere ise ee 


Eq<x Eq<x 
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It is interesting to compare the result of Theorem 3.7.6 with a conjecture made 
by Gauss which was later proved by Siegel [600, Vol. II, pp. 473-491]: 
Ș halogea ~ T eh X — o. (3.183) 
dex 18¢ (3) 
In both formulas the ds run over positive integers which are not perfect squares and 
which are congruent to 0 or 1 modulo 4. It does not appear to be possible to use 
either Theorem 3.7.6 or formula (3.183) or both to attack the question of whether 
an infinite number of hq are equal to 1. 
To emphasize the analogies with prime number theory one can define Selberg’s 
zeta function for SL(2,Z) as 


Z(s)=[[[[0 -677 4)" ifRes>1. (3.184) 
k>0 d 
This function has an analytic continuation as a meromorphic function in the whole 
complex s-plane with nontrivial zeros at s such that s(s— 1) is an eigenvalue in the 
discrete spectrum of A on TĦ and at the poles of the function 


p(s) =A(1—s)/A(s), A(s) = 2°77 (s/2)E(s) 


in Re s > 5 (see Gangolli and Warner [195], Hejhal [261], Selberg [569], Venkov 
[693], Marie-France Vignéras [700] and Voros [710]). Vignéras [700] shows that 
the functional equation of Selberg’s zeta function involves the Barnes double gamma 
function, which was used by Shintani [595] in the evaluation of zeta functions of 
totally real number fields at negative integer values. 

Selberg’s zeta function is shown by Elstrodt [155, p. 68] to arise through the 
formation of the trace of RyRy, where R} = (A—A)~! = the resolvent of the 
Laplacian. Thus the Selberg zeta function actually contains as much information 
as the trace formula itself. 

Ruelle [549] discusses various zeta functions from number theory and geometry. 
He notes that Selberg’s zeta function is a special case of a zeta function considered 
by Smale for differentiable flows on compact differentiable manifolds, assuming 
that the fundamental domain I’\H is compact, of course. Graph theory analogues 
of Selberg’s zeta function, known as Ihara zeta functions, are discussed in my book 
[671]. 

There are many other applications of Selberg’s trace formula. For example, one 
can return to the question discussed in the introduction to this section: Can one 
hear the shape of a drum? This is the first of the following questions. The second 
question can be viewed as involving the set of lengths of periodic billiard paths on 
the fundamental domain. The questions are theoretical rather than practical since it 
is unlikely that any human can hear all the overtones of a drum or compute all the 
periodic billiard lengths. Physicists have found the results to be of interest in work 
on quantum chaos. See Gutzwiller [242]. 
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Questions About Vibrating Manifolds 


1. Is the drum determined up to isometry by the spectrum of the Laplacian? 

2. Is the set of lengths of closed geodesics determined by the spectrum of the 
Laplacian? You see the lengths and you hear the eigenvalues. 

3. Is the drum determined (up to isometry) by its length spectrum? 


Some Answers 


1. Marie-France Vignéras [701] shows that there are non-isometric compact 
Riemann surfaces whose Laplacians have the same spectrum. Sunada [649] 
showed that one can obtain isospectral but nonisometric manifolds using the 
trace formula and a bit of group theory that goes back to some work in algebraic 
number theory giving examples of nonisomorphic algebraic number fields with 
the same Dedekind zeta function. We discuss graph theory analogues of this in 
Stark and Terras [633]. See Terras [668] for finite analogues, as well as [671]. 
Another reference for the compact case is Buser [74]. The Buser examples 
ultimately led to the planar drums that cannot be heard. See Carolyn Gordon 
et al. [227]. 

2. Gangolli [191] shows, using the Selberg trace formula, that when I\H is com- 
pact the spectrum of the Laplacian determines the lengths of closed geodesics. 
Huber [307] proved that the Laplace spectra of two compact Riemann surfaces 
are the same if and only if the surfaces have the same length spectra. See Buser 
[74, Chap. 9]. 

3. Wolpert [744,745] showed that a generic compact Riemann surface is determined 
up to isometry by its length or Laplace spectrum. See Buser [74, Chap. 10]. 


Selberg also used the trace formula to compute dimensions of spaces of 
holomorphic cusp forms (see Selberg [569] and Hejhal [261]). This method has 
been used with success for higher-rank groups (see Volume II or [667]). It is closely 
connected with Hirzebruch’s generalization of the Riemann—Roch theorem as well 
as the Atiyah—Singer index theorem. However, there does not exist a formula for the 
dimensions of spaces of cuspidal Maass waveforms of the type we have considered. 

Selberg [569] and Eichler [152] use the trace formula to obtain a formula 
involving class numbers of positive definite binary quadratic forms for the traces 
of Hecke operators (see Hejhal [261] and Lang [391], the chapter by Zagier on the 
trace formula, with correction in Zagier [753]). This version of the trace formula is 
called the Eichler—Selberg trace formula. 

The trace formula is intrinsic to work on the Langlands program. Of course the 
language of this subject is adelic representation theory which we have avoided here. 
See Bernstein and Gelbart [37]. 

The trace formula appears in much of the work on the Artin conjecture on 
the holomorphy of Artin Z-functions. References are Saito [552] and the article 
of Gérardin and Labesse in the Corvallis conference proceedings (see Borel and 
Casselman [52, Vol. II, pp. 115—134]), as well as the article of Shintani in the same 
conference proceedings [loc. cit., pp. 97—110], and finally the notes of Langlands 
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[394] in which a proof of a new case of Artin’s conjecture is given. See Tunnell 
[686, 687] for an extension of Langlands’ results. 

We consider a final application of the trace formula in Sarnak’s formula (3.185) 
below counting modular knots with given link number with respect to the 
trefoil knot. 


3.7.6 Tables Summarizing the Main Results 


Tables 3.11 and 3.12 plus Fig. 3.46 summarize the main results of this chapter—at 
least those which are easily summarized. We are assuming T = SL(2, Z) throughout 
the tables. These should be compared with the tables in Sect.3.7.8 where H is 
replaced by the finite upper half-plane H,. Here we view IR? as the symmetric space 
R?/ {0} rather than the symmetric space of the Euclidean group M (2,R”) modulo 
the subgroup SO(2). 


3.7.7 Modular Knots 


There is a surprising connection between primitive closed geodesics in SL(2,Z)\H 
and knots coming from the Lorenz nonlinear differential equations introduced in 
1963 as a model of atmospheric convection. These connections were found by Ghys 
[210]. See the beautiful website of Ghys and Leys [211]. For more information on 
knots see Adams [1]. 

First one can identify SL(2,R)/SL(2,Z) with the complement of the trefoil knot 
in R?. The trefoil knot T is the knot pictured in Fig.3.47. To see this, one must 
identify SL(2,R)/SL(2,Z) with the space of lattices L = Z@, + Za, in the plane 
and look at the corresponding Eisenstein series 


g= $ o™ ad gly= Y o™. 
weL—{0} @eL—{0} 


Then (g2,g3) correspond to a lattice iff the discriminant A = g3 — 27g3 4 0. So the 
space of lattices can be identified with C? — {A = 0}. Scale the lattice by a positive 
constant in order to restrict to lattices such that |g2|° + |g3|° = 1. 
t/2 

The flow in SL(2,R)/SL(2,Z) given by & ye ) € SL(2,R) corresponds to 
the geodesic flow on SL(2,Z)\H, where H denotes the upper half-plane. Primitive 
closed geodesics are orbits of this flow. They correspond to primitive hyperbolic 
conjugacy classes in SL(2,Z); i.e., {y} with |Tr(y))| > 2. 

You can thus view the orbits of this flow in SL(2,Z)\H as knots in R? — 7 (the 
complement in 3-space of the trefoil knot). These are now known as modular knots. 
We use the notation ky for the modular knot associated to y € SL(2, Z). For example, 
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Table 3.11 Comparison of Euclidean and non-Euclidean harmonic analysis in two dimensions 


General 


Symmetric space 
X ~G/K 


Euclidean 


x= (2) ER? 
x2 


Euclidean plane 


Non-Euclidean 


z=x+iy€H 
Poincaré upper half-plane 


Arc length ds d’ = dxi + dx d? =y? (dx? + dy”) 
Laplacian A= a+ a= (th) 
G-invariant area d u du = dx\dx2 du =y~ dx dy 
G=isometry group R? g= (° al € SL(2,R) 
c 
K = subgroup ot G K = {0} SO(2), origin=i 
fixing the origin 
Action of g € GonxEX x—> x+ g, vector addition| z —> gz = atb 
T =discrete subgroup of G z? SL(2, Z) 
Elementary eigenfunctions of e,(x) =e?" “@ ps(z) = (Imz)* 
A 
Eigenvalues Aea = —47? al ea Aps =s(s— 1)ps 
hs(2) = J (Im(kz))*dk 
Spherical function (zonal) q(x) X 


Helgason—Fourier transform 


R2 sECkEK 
Fourier inversion or F(x) ; (x) = yr f f Hf (2 + it,k) 
iti SIN tER keK 
spectral decomposition of A xpi ,q(ke)t tanh (mt) dt dk 
2 
Convolution H(f*g)=Hf-Heg, 


defined by convolution on G 


fre= fê, fig EL! (R?) 


f,& €L! (H), for K — invariant g 


Differentiation 
Heat equation 
u= Au, u(x,0) = f(x) 
u(x,t) = f*g, 
g:= fundamental solution 


AFO) = —42? lly? FO) 


g(x) = ne exp(— Ii? C) 


H (Af) (s,k) =5(5— 1)Hf(s,k) 
81 (kei) 


v2 bet? (4) 
(4nt)3/2e/4 J vcoshb—coshr 
P 


Horocycle transform 


No analogue 


F(y) =y" fr f(x+iy)dx 
invertible 


we have the green knot corresponding to the hyperbolic element ( ' ) in Fig. 3.48, 


shown there winding about the trefoil knot. See Ghys and Leys [211] for this and 


more examples. 


Ghys proved (see [210]) that such modular knots are Lorenz knots. The Lorenz 
knots are closed solution curves of the Lorenz equations: 


dx 


d 
= 28x — y— xz, 


dt 


dy 
= 10(y—x), a 


dt 
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KIT x—> x + (0, 1) 
Fundamental domain 

polygon with sides (0, 1) C154) 
identified 


x—> x + (1, 0) 


Jec (x) 


Draw an example 
with support in the 
fundamental domain 


Graph of f Graph of f 


g(x) = 


È frx) 


yer 


P-periodization of f 


Graph of g Graph of g 


Fig. 3.46 I-periodizations of compactly supported functions on H 


Integrate these ODEs and obtain orbits that give a flow u(t),t € R. These solution 
orbits stay in the neighborhood of a fractal region shaped like a butterfly (called a 
strange attractor). A Lorenz knot is a periodic orbit of the Lorenz flow. These knots 
were studied by Joan Birman and Robert Williams in 1983 (see [41]). In particular, 
they found the link types of these knots. It is hard to compute such things thanks to 
the chaotic nature of the Lorenz equations and the butterfly effect, which means that 
a slight change in initial condition can make a huge change in the solution. 

To define the linking number of two oriented nonintersecting knots, we need 
Fig. 3.49. Project the knots onto the plane. At the points where the purple knot 
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Fig. 3.47 The trefoil knot. T 


Fig. 3.48 The modular knot associated with y = 
[211] shown linked with the trefoil knot (blue) 


3) (pictured in green) from Ghys and Leys 


3.7 Harmonic Analysis on the Fundamental Domain 373 


Fig. 3.49 Computing the 
linking number of the purple 
knot with the turquoise knot 


crosses over the turquoise one, you get two types of picture. If you see the top picture 
in Fig. 3.49, you get +1. In this case, when you rotate the knot that goes under; i.e., 
the turquoise knot, clockwise so that it lines up with the purple knot, then the arrows 
line up. If you see the bottom picture in Fig. 3.49, you get — 1. In this case, you need 
to rotate the turquoise knot in a counterclockwise direction to get the arrows to 
line up. Sum these numbers over all crossings of the purple knot over the turquoise 
knot and get the linking number denoted link(purpleknot, turquoiseknot). Adams 
[1] shows on p. 20 that it does not matter what projection you choose to compute 
the linking number. He also shows that this number can be 0 even though the knots 
cannot be pulled apart. 

Now comes the amazing part. Since the modular knots can be viewed as 
occurring in the complement of the trefoil knot in the unit ball in R?, it is natural 
to want to compute the linking numbers of these modular knots with the trefoil 
knot using modular forms somehow. Ghys proved that the linking number of a 
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modular knot with the trefoil knot is the Rademacher function ¥ : PSL(2,Z) = 
SL(2,Z) /{41} > Z, which comes out of the multiplier system for the Dedekind eta 
function; i.e., 


link(ky, T) = ¥ (y). 


Here the Rademacher function ¥ (y) comes from the transformation formula for the 
eta function: 


n(y2) = v; (Y)(cz+d)?n(z), forall y€ SL(2,Z), 


v) = e™Y0)/12 when Tr(y) > 0. 
The computation of the Rademacher function can be done by writing y € SL(2,Z) 


1 0 10 
Y(y) = e1 +: --+e,. See Rademacher and Grosswald [527, pp. 45-58] for proofs of 
this and much more information on the Rademacher function and its connection to 
Dedekind sums. 


as a product of U = ry and V = é a): If y= UV*!.--UV, then 


Exercise. Compute link(ky, T) for y = ( i ») 


Sarnak [559] uses the Selberg trace formula to show that 


Hon as x—> o. (3.185) 


primitive hyperbolic in SL(2,Z x 
Tr(y) <x, and link(ky,t) =n 12 (logx)’ 


He also shows a more precise version of this formula which implies that the most 
common linking number between a modular knot and the trefoil knot is 0. 

One wonders whether other topological invariants of modular knots and links can 
be computed using modular forms somehow. 


3.7.8 Finite Analogue of the Selberg Trace Formula 


It is hard to resist seeking a trace formula for some subgroup T of GL(2,F,) acting 
on the finite upper half-plane H,, if q = p”, for an odd prime p and r > 1. We did 
this in [668] for T = GL(2,F p). Shaheen [583,584] considered the trace formula 
for various other subgroups I of G. Here we shall only state the trace formula for 
T = GL(2,F p) acting on H „2. 

In our first table we summarize what we know from our earlier section on 
finite upper half-planes and finite upper half-plane graphs. We found in our book 
[668] that it was necessary to know the irreducible unitary representations of G = 
GL(2,F,), q4 = p”, p an odd prime, since not only principal series but also discrete 
series representations arise in studying the eigenvalues of the adjacency operators 
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Table 3.13 Finite upper 


hal planes Finite upper half-plane 


Space = H4, q odd 

= {z=x+ V ôy | x,y €Fy,y # 0} 
where 6 £a’, for any a € Fy 

Group G=GL(2,F,) 


a (CP) eseaa 


Group action gz = “77, z € Hy 
Origin = Vô 
Subgroup K fixing origin 


ay) cal = F,(v5)* 


Pseudo-distance d(z,w) = Liem 
Im(x+yv6) =y, N@+yvV6) =x -y5 
Graph Xa = X,(6,a), vertices z € Hy 
edge between z and wif d(z,w) =a 


Aa = adjacency operator for X, 
Aa =Aa— (q+ DEL if a #0 or 46 


Spherical function 
hr (2) = qey Èx An (kz), n € Ô“, Xx = T(z) 
i.e., 7 occurs in Ind€1, dr = deg 


Spherical transform 

of f : K\G/K — C 

f(x) = Eeen, f(z)hn (2) 
Inversion 

FO) = yoy Erce dn f(T)hr (2) 
Horocycle transform 

F(y) = Erer, f(x +y vô) 


not invertible 


on finite upper half-plane graphs. We need the discrete series representations 
again in our study of the trace formula. The notation p = Ind§1 means the 
induced representation obtained by inducing the trivial representation on K up to 
the representation p on G by setting (p(g)f) (x) = f(gx), for all x,g € G. Here 
fEV={f:G—>C |f(kx) = f(x), vk € K,x € G}. The notation GX means the 
irreducible unitary representations of G that occur in Ind§1. 

The details of the computations for the following two tables are to be found 
in [668]. Table 3.13 summarizes the basic facts about finite upper half-planes and 
their associated graphs. Table 3.14 summarizes the trace formula for the subgroup 
T = GL(2,F,), p an odd prime acting on H2. These tables should be compared 
with the tables in Sect. 3.7.6. 

It is also possible to consider p-adic symmetric spaces, which are p + 1-regular 
trees; i.e., graphs that have no cycles. This is done in my book [668, Chap. 24] plus 
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Table 3.14 Selberg trace 


T = GL(2,F,); T =GL(2,F,), q= p? 
formula on finite upper C G= CL, 7) GL(2,F,), q4=p 


half-plane Conjugacy classes in I 
Central G „ac Fs 
Hyperbolic ( ; n) „abe F; 
Parabolic i 1) acF, 
= P 


Selberg trace formula (q = p’) 
x f(a)mult (1, Ind@1) 


meGk 
=|T\G\(p— 1) f(/5) + SEY E Fe) 
ceF, 
cA#l 
449-1) F(1)-f VE + 2! 5 F a+bn ] 
EO- + SS, E 
b40 


where n? = 6, n € Fy 


the papers with Dorothy Wallace [672] and with Horton and Newland [304]. Tables 
comparing the tree with the upper half-plane and the finite upper half-plane are to 
be found in my paper [669]. The trace formula on a tree has many applications; e.g., 
to obtain the Ihara determinant formula for the Ihara zeta function of a finite regular 
graph. This is discussed in [668, Chap. 24] and [304]. 

Another version of the trace formula on finite upper half-planes was found by 
Shaheen in [583, 584]. Shaheen follows the methods of Hejhal [261, Vol. I], 
avoiding all mention of group representations. 
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Weil, 236 
conjugacy class, 83, 344-346, 352, 355, 362, 
364, 367, 376 
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eigenvalues, 23, 36, 39, 45, 54, 56, 58, 59, 88, 
89, 92, 93, 96, 112, 114, 115, 119, 
120, 126, 131, 138, 142, 155, 224, 
237, 268, 273, 275, 276, 278, 280, 
281, 283, 284, 286, 287, 317, 320, 
329, 334, 346, 347, 362, 365 
E,, nonholomorphic Eisenstein series, 
260-263, 267, 269, 271, 284, 
312-314, 320, 321, 323, 325, 333, 
336, 340-342, 344, 356 
Eisenstein series, 66, 82, 229, 230, 234, 236, 
240, 253-255, 259-264, 267-269, 
271, 272, 285, 288, 295, 298, 304, 
312, 313, 316, 320,321,323, 325, 
339 
electromagnetic spectrum, 48, 156 
elementary divisor theory, 72, 79 
elementary row and column operations, 79 
elliptic 
curve, 209, 235, 237, 242, 250-252, 265, 
267, 284, 300, 301 
element, 214, 317, 345, 354, 361 
fractional linear transformation, 204, 213 
function, 235, 247—250, 257, 258 
geometry, 112, 147, 149, 153 
integral, 200, 235, 247, 248 
partial differential equation, 259 
point, 213, 214, 240 
elliptic fractional linear transformation, 213 
energy level, 48, 58, 89, 119 
Epstein zeta function, 33, 44, 58, 63, 64, 67, 69, 
73, 76, 79, 82, 86-88, 229, 259, 260, 
262, 273, 283, 292, 325, 339, 341 
equation of degree 5, 202 
equivalent 
geodesics, 204 
matrices, 163, 309 
points, 203, 213, 260, 326 
positive definite matrices, 243 
representation, 126—128 
Riemann surfaces, 209 
ergodic theory, 36, 104, 280, 349, 350 
Escher, 198 
n, Dedekind eta function, 230, 238, 264, 374 
Euclid’s fifth postulate, 112, 152 
Euclid’s second postulate, 112 
Euclidean algorithm, 204 
Euclidean group, 83, 90, 136, 148, 197, 367 
Euler 
—Maclaurin summation formula, 362 
-Lagrange equation, 110 
angle decomposition, 116 
constant, 264, 357 
formula for ¢(2n), 82, 265 
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formula for T (s), 5, 292 
formula for genus, 214 
function, 75 
Lagrange equation, 110 
product, 63, 75, 76, 302, 304, 305, 307, 311, 
362 
even integral positive matrix, 243, 256 
Ewald’s method of theta functions, 87 
excited state, 120 
expectation or mean, 26, 191, 192 


F 
Farey fractions, 315 
fast Fourier transform or FFT, 31, 46, 90, 94 
Féjer kernel, 3, 34 
Feynman integrals, 319 
Fibonacci tiling, 98 
finite analogue of Euclidean distance, 90 
finite Dirichlet polygon, 225 
finite Eisenstein series, 291 
finite element method, 54 
finite Euclidean graph, 91 
finite Euclidean space, 90 
finite fundamental domain, 289 
finite general linear group, 222, 292, 374 
finite geodesic, 222 
finite horocycle, 222 
finite non-Euclidean distance, 222 
finite rotation group, 222 
finite simple group, 242 
finite symmetric space, 90 
finite tessellation, 225 
finite trace formula, 374, 376 
finite upper half-plane, 221, 223, 227, 289, 374 
finite upper half-plane graph, 223 
Fischer-Griess monster group, 242 
Fourier analysis on symmetric space, 9, 117, 
178 
Fourier analysis on the fundamental domain, 
34, 39, 333, 352 
Fourier coefficient, 31, 40 
Fourier inversion, see inversion of a transform 
Fourier series, 3, 4, 6, 13, 23, 31-37, 39-41, 43, 
44, 86, 94, 101, 118, 130, 176, 180, 
196, 228, 239, 257, 258, 262, 269, 
320, 325 
generalized, 54 
non-Euclidean, 316, 342 
Fourier transform, 9-12, 14-19, 21, 23, 25, 
26, 28, 29, 31, 36, 40, 42, 43, 46, 
49-51, 59, 61, 65, 96, 98, 99, 101, 
111, 133-135, 137, 143, 144, 148, 
176, 190, 196, 332, 338 
on Z/nZ, 94 
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Fourier-Bessel series, 141 
fractal, 104—106 
fractional ideal, 73, 80 
fractional linear transformation, 68, 150-153, 
150, 203, 204, 210, 212, 213,222, 
327, 343, 344 
Frobenius reciprocity law, 128 
Fubini theorem, 12 
Fuchsian group, 210, 211,217, 254, 319 
functional equation, 64, 66, 80, 88, 135, 140, 
141, 167, 169, 172, 175, 179, 262, 
274, 293-296, 298-300, 321, 324, 
325, 360, 362, 365 
fundamental domain, 31, 147, 197—199, 
202-205, 207-216, 218-221, 
225, 227, 228, 231-233, 240, 241, 
254, 260, 265, 268-273, 275, 278, 
280, 286, 310, 312-314, 316, 319, 
320, 326, 333, 337, 339, 343, 346, 
348-351, 353, 365 
fundamental function or Hauptmodul, 240 
fundamental or Poincaré group, 108 
fundamental solution 
heat equation, 3, 18, 29, 133, 186-191, 193, 
195, 342, 361 
Laplace equation or Green’s function, 5 
wave equation, 19 
Schrödinger wave equation, 319 
fundamental unit, 73—76, 78, 79, 81, 82, 334, 
346, 347, 363, 364 
Funk-Hecke theorem, 132, 133 


G 
T(N), 211 
To(N), 200, 244 
Gauss 
-Bonnet formula, 112, 153, 265 
distribution, 133, 192, 237 
hypergeometric function, 173-175, 339 
kernel, 3, 10, 14, 27-29, 189 
sum, 267, 289, 290, 299 
Gelfand criterion, 89 
Gelfand pair, 89 
general linear group, 62, 74, 76, 79, 83, 125, 
130, 196, 223-225, 227, 243, 244, 
286, 289, 291, 300, 334, 375 
generators of groups, 204, 211,220, 299 
generators of quaternion algebra, 219 
genus, 209, 214, 215, 221, 247, 334 
geodesic, 111, 112, 149, 151-153, 156, 208, 
209, 215, 254, 280, 315-317, 334, 
346-349, 364, 366, 367 
-reversing isometry, 89, 154 
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flow, 280, 350, 367 
polar coordinates, 154, 155, 171, 173, 174, 
187, 354 
polygon, 204 
Gibbs phenomenon, 34, 49, 118 
Girard formula, 112 
golden ratio, 97 
Green’s 
function or resolvent kernel, 5, 56—58, 60, 
61, 118, 131, 138-142, 254, 327, 
333, 337-342 
theorem, 5, 115, 167, 187, 268, 270, 272 
Grenzkreisgruppe, 210 
grossencharacter, 82, 268, 298, 300 
group representation, 20, 48, 92, 125-131, 163, 
203, 224, 225, 227, 276, 286, 300, 
317, 366, 376 


H 
Haar measure, 66, 127, 128, 130, 132, 133, 
181, 327, 328, 330 
Hamming distance, 252 
Hankel 
inversion formula, 137, 141, 169 
Hankel function, 140 
Harish or Abel or horocycle transform, 182, 
352, 357, 361, 363 
Harish transform, 352 
harmonic analysis, see Fourier analysis 
harmonic function, 170, 270 
harmonic polynomial, 114 
Hasse-Weil zeta function, 301 
Hauptkreisgruppe, 210 
Hauptmodul or fundamental function, 240 
Hausdorff dimension, 106 
heat equation, 3, 13, 18, 23, 29, 133, 164, 
186-191, 193, 195, 317, 342, 361 
heat kernel, 164, 165, 186, 193 
Heaviside step function, 4, 18 
Hecke 
algebra, 225 
Bochner-Hecke formula, see Bochner- 
Hecke formula 
correspondence, 292, 294, 299 
Funk-Hecke theorem, see Funk-Hecke 
theorem 
integral formula, 77,79, 81, 82 
L-function, 82, 268, 284, 298, 300, 360 
operator, 276, 278, 284, 285, 294, 303-306, 
308, 312, 314, 315, 317, 366 
points, 278 
triangle group, 216 
Heisenberg uncertainty principle, 21 
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Helgason transform, 178, 180, 182, 183, 187, 
328, 329, 333, 335, 344, 353, 363 
Hermite polynomials, 59 
hidden periodicities, 48, 50 
highest point method, 203 
highest weight, 128 
Hilbert transform, 144 
Hilbert’s 12th and 18th problems, xi, 70, 74, 
82, 197, 202 
Hilbert-Schmidt operator, 53 
Hilbert-Schmidt operator, 53, 54, 57, 58 
Hgjendal method for Madelung constant, 86 
homogeneous space, 108, 153, 154 
homology, 315 
horocycle, 181, 182, 280, 312, 314, 316, 348, 
349, 352 
Huyghen’s principle, 20 
hydrogen, 118, 120, 121, 123, 196, 320 
hyperbolic 
3-space, 189 
element of SL(2, R), 254, 255, 341, 
345-348, 353, 362, 364 
fractional linear transformation, 213 
geometry, 149, 152, 153, 156, 208 
group, 215 
triangle, 153 
upper half-plane, H, 150 
hyperbolic fractional linear transformation, 
213 
hyperfunction, 178 
hypergeometric function, see Gauss or 
confluent hypergeometric function 


I 

I,, see Bessel function 

ideal, 72, 73, 75-77, 79-81, 202, 205, 207, 345 

ideal class group, 73, 77, 208, 241 

images, method of, 44, 57, 58, 60, 337, 342 

impedance, 157, 160 

incomplete gamma function, 62, 64, 65, 67—69, 
81, 88, 293, 296, 298 

incomplete theta series, 321-324 

independent random variables, 7, 25-29, 190, 
192, 193, 195, 237 

indicator function of a set, 195, 314 

induced representation, 128, 224, 375 

Infeld-Hull factorization method, 58 

instrument function, 49 

integral basis, 72 

integral test, 33, 64, 340 

interferogram function, 48 

intertwining operator, 126 
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invariant differential operator, 66, 111, 153, 
178 
invariant integral, 127, 128, 328 
invariant random variable under rotation, 
189-192, 195 
inversion in a sphere, 57, 249 
inversion of a transform 
Fourier, 10-12, 14, 16, 18, 20, 26, 36, 40, 
41, 49, 137, 180 
Fourier on Lie group, 129, 196 
Fourier on symmetric space, 175, 354 
Hankel, 137 
Helgason, 178, 180, 183, 353 
Kontorovich-Lebedev, 142, 168, 175, 176 
Laplace, 22, 23 
Mehler-Fock, 175-177, 181 
Mellin, 61, 62, 169, 183, 293, 294, 297, 
321, 324 
Radon, 142, 144 
irreducible representation, 126, 128, 131, 287 
isometric circle, 210 
isometric Riemann surface, 366 
Iwasawa decomposition, 331 


J 

Jy, see Bessel function 

Jacobi derivative formula, 247 

Jacobi identity for A, 238 

Jacobi theta function, 246, 253 

Jacobi transformation formula for theta 
function, 44 

Jacobi triple product formula, 250 

Jacobi-Abel functions, 247 

Jacobian elliptic function, 248, 249 

Jordan form, 212, 344, 345 


K 

K, see maximal compact subgroup 

K,, see Bessel function 

K bi-invariant function, 224, 328 

K-invariant function, 181—183, 189, 190, 
192-195, 343, 344, 352, 353 

K-theory, 82 

kernel of integral operator, 3, 4, 33, 34, 41, 53, 
54, 56,57, 169, 175, 288, 327, 337 

Kirchoff’s formula for wave equation, 20 

Kleinian group, 210 

Kloosterman sum, 93, 254, 290, 313 

Kodaira—Titchmarsh formula (Stieltjes-Stone 
also), 137 

Kodaira-Titchmarsh formula (Stieltjes-Stone 
also), 138, 140, 327 
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Kontorovich-Lebedev transform, 142, 164, 
168-171, 175-177 

Korteweg—DeVries equation, 257 

Korteweg-DeVries equation, 258 

Kronecker limit formula, 264, 265 

Kronecker symbol, 273 

Kronecker theorem, 101 


L 
L-function, 72, 74, 75, 81-83, 227, 237, 265, 
268, 273, 274, 283, 284, 299-301, 
310, 317, 360, 366 
A, Laplace operator, 110-112, 114, 115, 
117-119, 133, 136 
A, Laplace operator, 5, 18, 31, 36-38, 45, 
56-58, 60, 92, 155, 164, 167, 170, 
173, 184, 185, 187, 258-260, 263, 
267, 270, 272, 273, 280, 283, 286, 
288, 297, 317, 318, 320, 321, 323, 
326, 328, 329, 331,333, 337,338, 
340-342, 347, 360, 362, 365, 366 
A, Laplace operator, 153 
Laplace series of spherical harmonics, 118 
lattice space, 367 
least squares, method of, 51 
Lebesgue 
dominated convergence theorem, 6, 11, 12, 
188, 195 
integrable function, 2, 15, 143, 349 
integrable functions, L! (Xh T 11, 12532; 
127 
integral, 11, 13 
measurable function, 139 
measure, 2, 25, 101, 322 
square integrable functions, L? (X), 11 
left G-action, 154 
Legendre function 
P;, 113, 114, 172-177, 191, 192, 224, 329, 
338 
Qs, 338, 339 
Legendre symbol, 244 
lemniscate integral, 200 
length spectrum of Riemannian manifold, 346, 
347, 364, 366 
length standard, 46 
level of congruence subgroup, 211, 215, 244, 
247, 286, 301, 306 
Lie group, 89, 123, 125-129, 150, 168, 189, 
196, 199, 203, 242, 244, 250, 253, 
322, 356 
limit point of a discrete group, 210 
limit-point and limit-circle case in ODE, 140 
linking number of knots, 371 
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Liouville theorem 
in complex analysis, 241 
on conformal maps in 3-space, 249 
Lobatchevsky upper half-plane, see Poincare 
upper half plane, see Poincaré upper 
half-plane, H 
locally Euclidean space, 89, 125 
locally integrable function, 2, 4, 6-8 
Lorentz knot, 368 
Lorentz-type group, x, 123, 150, 162, 163 
Lyman series of spectral lines, 120 


M 

Maass cusp form, 268, 272, 273, 276, 280, 284, 
287, 288, 307 

Maass waveform, 259, 262, 269, 276, 278, 280, 
284, 295-297, 306, 314, 361 

Maass-Selberg relations, 272 

Madelung constant, 86-88 

magnetic field, 120, 121, 123, 147 

matched load in an electrical network, 157, 158 

matrices associated wth electrical circuits, 
160 

matrix entry of a group representation, 126, 
131 

maximal compact subgroup, K, 153, 154, 178, 
181, 187, 190, 195, 343, 344, 352, 
353 

maximum principle for harmonic functions, 
270 

Maxwell’s equations, 123, 150 

mean, 26, 28, 50, 189, 191, 192, 195 

Mehler-Fock transform, 164, 169, 175-177, 
181, 355 

Mellin transform, M, 61, 62, 77, 169, 176, 182, 
183, 200, 244, 263, 292-297, 299, 
314, 321, 323-325, 339, 340, 343, 
332 

Mercer’s theorem, 54, 333, 344 

method of images, see images, method of 

microwave engineering, 156, 158, 160 

Minakshisundaram-Pleijel zeta function, 88 

minimax principle, 283 

minimum of a positive definite quadratic form 
over integer lattice, 67 

modular form, 200, 227—231, 233-235, 
238, 240, 243, 244, 253, 255, 259, 
292-294, 297-301, 303, 304, 311, 
313,315 

modular function 

associated with Haar measure, 127 
for the modular group, 240, 265, 289, 292, 

302 
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modular group, SL(2,Z), 197, 203, 205, 214, 
216, 219, 230, 238, 242, 259, 268, 
272, 289, 314 

modular knot, 367 

modular symbol, 315 

modularity theorem, 300 

moduli variety, 242 

modulus, A, 236, 249, 250 

moonshine, 242 

multiplication formula for Fourier transform, 
10, 15, 176 

multiplier system, 228, 229, 234, 238, 
243-245, 374 


N 

NaCl, see salt 

narrow class number, 345, 363 

Neumann problem, 195, 273-275, 297 

non-Euclidean distance, 152, 153, 193, 209, 
334 

non-Euclidean Eisenstein series or Epstein zeta 
function, 339 

non-Euclidean geometry, 112, 149, 150, 156 

non-Euclidean lattice point or circle problem, 
319, 333, 334 

non-Euclidean normal distribution, 188, 190, 
191 

non-Euclidean Poisson summation formula, 
333, 334, 340-344 

non-Euclidean shock wave, 316 

non-Euclidean theta function, 335 

norm in field extension, 71, 72,74, 77, 222, 
345 

norm of an element of a quaternion algebra, 
218-220 

norm of an ideal in a number field, 72 

norm of hyperbolic element of SL(2, Z), 345 

normal density, 27, 29, 186, 188, 190-192, 195 

normal or Dirichlet or Poincaré fundamental 
region, 209, 225 

nuclear magnetic resonance tomography, 147 


(0) 

O(y?), 314, 325, 336, 362 

o(y?), 271, 272, 357, 359, 362 

orthogonal group or rotation group, O(n) 
or SO(n), 83, 90, 91, 107, 108, 
111, 115, 116, 118, 120, 125, 128, 
130-134, 136, 147, 152-155, 
178-180, 182, 186, 187, 189, 190, 
193, 195, 199, 222, 327, 328, 330, 
331, 333, 334, 338, 354 
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octahedral group, 83 

orbital integral, 351, 352, 354, 356 

order in an algebra or field, 218-221, 305, 345, 
346, 363 


P 

p-adic number, 60, 82, 131, 189, 303, 375 

Paley-Wiener theorem, 16, 168, 175, 178 

parabolic element, 317, 319, 344, 345, 351, 
355, 360, 362 

parabolic fractional linear transformation, 213 

Parseval identity, 11, 32 

partitions, 238 

Pell’s equation, 265 

Penrose tiling, 97, 98 

periodic geodesic, see closed geodesic 

periodization of test function or distribution, 
42 

periodogram, 51 

perpendicular bisectors method, 208, 209 

Peter-Weyl theorem, 130, 131 

Petersson inner product, 304, 307 

Phragmen—LIndelof theorem, 325 

Picard theorem, 202, 241 

Plancherel identity, 11, 29 

Plancherel measure, 11, 129, 130, 141 

Plancherel theorem, 129, 130, 179, 188, 195 

Planck constant, 48, 118 

Poincaré polygon, see normal or Dirichlet or 
Poincaré fundamental region 

Poincaré generators and relations theorem, 
204, 211 

Poincaré group, see fundamental or Poincaré 
group 

Poincaré arc length, 151, 152, 155 

Poincaré series, 253—255, 288, 313, 321, 334 

Poincaré upper half-plane, H, x, 89, 149, 150, 
160, 221, 223 

point group, 83 

point spectrum, see discrete or point spectrum 

Poisson integral, 178 

Poisson kernel, 178 

Poisson summation formula, 39, 40, 42—45, 53, 
55, 59, 60, 80, 101, 147, 200, 230, 
246, 263, 264, 291, 316, 331-334, 
340-344, 356 

Poisson’s integral formula, 138 

polar coordinates, 5, 19, 110, 111, 114, 134, 
136, 137, 154, 155, 164, 171-174, 
187, 354 

positive definite symmetric matrix space, Pp, 
261 
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positive definite symmetric matrix space, Pn, 
63, 64, 66, 67, 69, 70, 77, 79, 81, 85, 
88, 153-155, 242, 243, 252, 256, 
257, 260, 261, 265, 269, 334, 335, 
363 

positive operator, 54 

potential theory, 178, 248, 249 

power function, ps, 165 

power function, ps, 260, 328 

power spectrum, 50 

prime geodesic theorem, 364 

prime number or ideal theorem, 76, 200, 267, 
347 

primitive hyperbolic element, 346, 347, 353, 
362, 364, 367, 374 

principal ideal, 72 

probability density, 7, 21, 25-29, 121, 142, 
188, 190, 191, 193-195 

probability distribution, see probability density 

product of distributions, 6 

projection, 97, 373 

projection-valued measure, 138 

projective linear group, 204, 276, 287, 294, 
343, 355, 374 

projective plane, 112 

projective variety, 256 


Q 

quadratic form, x, 54, 63, 79, 80, 150, 153, 154, 
163, 196, 202, 207, 208, 243, 248, 
256, 257, 267, 288, 346, 366 

quadratic number field, 73, 75, 83, 207, 241, 
273, 284, 346, 347 

quadratic reciprocity law, 245 

quantum chaos, 365 

quantum limit, 281 

quantum mechanics, 48, 58, 84, 89, 118, 123, 
128, 150, 165, 203, 237, 280, 319, 
320 

quantum number, 119, 120 

quantum-statistical mechanics, 44, 63, 318 

quasicrystal, 97, 98, 101 

quaternion algebra, 214, 217-219, 221, 317 

quiche & salad, 262 


R 

Rademacher formula for partition function, 
238 

Rademacher function, 374 

Radon transform, R, 142—144, 147, 148 

Radon-Nikodym theorem, 25 

Ramanujan T-function, 236, 237 
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Ramanujan sum, 264 

random variable, 7, 25—28, 186, 189-195 

Rankin-Selberg method, 312, 313, 322 

reciprocity law, 83, 241, 245, 265 

regular polyhedra, 147, 223 

regulator, 74-77, 79, 81 

relative Poincaré series attached to hyperbolic 
element, 254, 255 

representation of a group, see group 
representation 

representation of an integer by a quadratic 
form, 256, 257 

residual spectrum, 326 

residue of Eisenstein series, 262, 271 

residue of zeta function, 64, 76, 80 

residue theorem, see Cauchy’s residue theorem 

resolvent, 268, 326, 337, 341, 365 

resolvent kernel, see Green’s function or 
resolvent kernel 

Riemann hypothesis, 66, 67, 81, 93, 262, 283, 
284, 305, 314, 317, 325, 360 

Riemann mapping theorem, 200 

Riemann method of theta functions, 65 

Riemann sphere, 240 

Riemann surface, 200, 209, 212-214, 218, 219, 
221, 334, 347, 364, 366 

Riemann zeta function, 61, 63, 66, 75, 81, 82, 
200, 229, 244, 256, 257, 261, 265, 
274, 295, 298, 302, 360, 362 

Riemann—Lebesgue lemma, 32 

Riemann-Lebesgue lemma, 12, 13, 143 

Riemann-Roch theorem, 232, 235, 366 

Riemannian manifold, 89, 110-112, 153, 280, 
317, 349 

right G-action, 154 

right invariant integral, 127, 128, 181, 327, 330 

Roelcke-Selberg spectral resolution of A on 
L? (T\H), 321, 343 

rotation group, see orthogonal group or rotation 
group, O(n) or SO(n) 

Rydberg constant, 120 


S 

salt, 83, 86, 87 

satellites in spectroscopy, 49 

Sato-Tate distribution or Wigner semi-circle 
distribution, 238 

Schrödinger operator, 59, 89, 118—120, 128, 
237, 319 

Schur’s lemma, 126 

Schwartz function, 9 

Schwarz-Christoffel transformation, 200, 248, 
249 
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second moment, 191, 192 

Selberg trace formula, 352, 360, 363, 366, 367, 
374 

Selberg transform, 178, 329, 344 

Selberg zeta function, 365 

semidirect product of groups, 136 

semisimple Lie group, 128 

separation of variables, 36, 114, 115, 164, 167, 
173, 185, 186, 263, 338 

shah functional, 42 

Shannon sampling theorem, 36, 49, 51 

Siegel modular form, 245, 247, 343 

Siegel modular group, 343 

Siegel upper half-space, 245 

Siegel zero, 81 

singular differential operator, 62, 137, 164 

singular eigenvalue problem, 58, 137, 139, 320 

singular series, 264 

slash operator, 301 

Smith chart, 157, 158 

smoothing, 34, 35 

soliton, 257 

Soto-Andrade formula for spherical functions, 
225 

source spectral density, 49 

space group, 83, 147 

space of Schwartz functions, .7, 9 

special linear group, SL(n), 90, 150, 162, 181, 
192, 196, 197, 207, 211 

SU (p,q), 150, 156, 196 

special Lorentz-type group, SO(p,q), 150, 163, 
196 

special orthogonal group, SO(n), 108 

special unitary group, SU(n), 123 

spectral lines, 48, 49, 118-121, 128 

spectral measure, 138, 141, 168, 169, 176, 321, 
327 

spectral theorem, 20, 54, 56, 126, 130, 137, 
138, 141, 154, 331 

spectroscopy, 46, 48, 49 

spectrum, 37, 58, 93, 119, 120, 138, 196, 258, 
283, 286, 317,320; 321, 326, 3341, 
340, 351, 361, 362, 365, 366 

sphere, 5, 19, 57, 107, 108 

spherical Bessel function, 135 

spherical function, 114-116, 132, 172-175, 
192, 224, 225, 291, 303, 329, 330 

spherical harmonic, 107, 112, 114, 116-119, 
121, 123, 125, 129-137, 147, 245 

spheroidal wave function, 56 

spurious eigenvalues, 273, 274, 341 

standard deviation, 26, 28, 192 

Stieltjes, Stone, Kodaira, Titchmarsh formula, 
137, 139 
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Sturm-Liouville operator, 113, 137, 139, 140, 
164 

sun’s magnetic field, 121, 123 

support of a distribution or function, 4 

surface or Laplace spherical harmonic, see 
spherical harmonic 

surface spherical harmonic, 112 

symmetric power, 163, 237 

symmetric space, 63, 89, 108, 112, 130, 136, 
150, 173, 175, 223, 224 

symplectic group, Sp(n,R), 163, 343 


T 
table of transforms 
Helgason, 183 
Kontorovich-Lebedev, 170 
Mebhler-Fock, 177 
Mellin, 62 
Fourier, 17 
Tauberian theorem, 24, 28, 45, 267, 314, 337, 
340, 362, 364 
tautochrone, 24 
Tchebychef polynomial, 306 
tempered distribution, 15 
tensor product of representations, 128 
tessellation, 198, 204, 205, 217 
test function, 2 
tetrahedral group, 83 
theta function, 44, 64-66, 88, 200, 228, 
242-247, 250, 253, 284, 288, 292, 
299, 306 
theta group, 200, 212 
time-limited function, 55 
time-series analysis, 48, 50 
topological group, 125 
torus, R” /Z™, 41, 101, 102, 129, 196, 214, 
215, 250, 317, 347 
torus graph, 91 
totally real number field, 71, 217, 221, 304, 
305, 365 
trace formula, 352, 360-363, 365-367, 374, 
aio 
trace of an element of an extension field, 71, 
92, 289 
trace of an operator, 54, 55, 129 
transcendental number, 82, 242 
transformation formula for 
a holomorphic modular form of weight k, 
293 
eta, 238, 374 
theta, 44, 64, 88, 243, 245, 246 
theta differentiated, 66 
transmission line, 157, 158, 165, 189, 195 
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trefoil knot, 367 
twisted L-function, 301 
twisted trace formula, 300, 317 


U 

uncertainty principle, see Heisenberg 
uncertainty principle 

uniform distribution, see distribution 

unimodular group, 128, 129 

unique factorization, 70, 72,73, 75 

unit disc, 150, 156, 178, 198, 205, 217 

unit group in number field, 73, 74, 83 

unitary representation, 20, 125—130, 224, 374, 
375 

universal covering surface, 200 

unramified field extension, 241 


Vv 
value of zeta or L-function, 64, 74, 79, 82, 83 
variance, 26, 191, 192 
vector space of holomorphic cusp forms of 
weight k, Z (T,k), 231 
vector space of holomorphic modular forms 
of weight k, 4 (SL(2, Z), k), 228, 
233, 234, 240, 243, 254, 257, 269, 
292-294, 301, 303, 304 
vector space of Maass cusp forms, 
SN (SL(2, Z), à), 268, 272, 285, 
287, 314, 360, 362 
vector space of Maass waveforms, 
MN (SL(2,Z),A), 259, 261, 262, 
268-270, 295-298, 306, 307, 309 
venus spectra, 46 
vibrating 
drum, 36, 37, 39, 366 
manifold, 366 
plate, 39 
rod, 31 
string, 14 
voltage reflection coefficient, 157 
volume of fundamental domain for SL(3, Z), 
265 


von Neumann spectral theorem, 137, 138 
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WwW 

wave equation, 14, 20, 36, 38, 60, 117, 147, 
196, 317 

wave number, 48 

wavelets, 94, 96, 97 

Weierstrass continuous nowhere differentiable 
function, 105, 106 

Weierstrass function, £2, 235, 236, 247, 248, 
250, 258 

weight enumerator of a code, 253 

weight for a space of functions, 139 

weight of a holomorphic modular form, 200, 
227-233, 240, 255, 256 

weight of a representation, 129 

Weil estimate for Kloosterman sums, 93 

Weil explicit formulas, 360 

Weil-Hecke theory, 299, 300 

Weyl character formula, 128 

Weyl ergodic theorem or criterion for uniform 
distribution, 36, 101, 102, 347 

Weyl’s theorem on asymptotic distribution 
of eigenvalues of Laplacian on a 
compact domain, 45, 360 

Wiener expression for Fourier transform on R, 
59 

Wiener integral, 318 

Wiener—Khintchine formula, 50 

Wigner semi-circle distribution, 237 


Z 
Zeeman effect, 120 
zero of doubly periodic function, 246 
zeros of holomorphic modular forms, 232 
zeros of zeta and L-functions, 66, 67, 81, 262, 
273, 305, 341, 360 
zeta function 
Dedekind, see Dedekind zeta function 
Epstein, see Epstein zeta function 
from a modular form, 300 
Thara, 365 
Riemann, see Riemann zeta function 
Ruelle and Smale, 365 
Selberg, see Selberg zeta function 
zonal spherical function, 115 


